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Abstract. Let K ^ I he a, parameter. A i^-approximate group is a finite set A in a (local) group 
which contains the identity, is symmetric, and such that A ■ A is covered by K left translates of A. 

The main result of this paper is a qualitative description of approximate groups as being essentially 
finite-by-nilpotent, answering a conjecture of H. Helfgott and E. Lindenstrauss. This may be viewed 
as a generalisation of the Preiman-Ruzsa theorem on sets of small doubling in the integers to arbitrary 
groups. 

We begin by establishing a correspondence principle between approximate groups and locally 
compact (local) groups that allows us to recover many results recently established in a fundamental 
paper of Hrushovski. In particular we establish that approximate groups can be approximately 
modeled by Lie groups. 

To prove our main theorem we apply some additional arguments essentially due to Gleason. These 
arose in the solution of Hilbert's fifth problem in the 1950s. 

Applications of our main theorem include a finitary refinement of Gromov's theorem, as well as 
a generalized Margulis lemma conjectured by Gromov and a result on the virtual nilpotence of the 
fundamental group of Ricci almost nonnegatively curved manifolds. 



Resume. Soit X ^ 1 un parametre. On appelle groupe ii'-approximatif toute partie finie A d'un 
groupe (ou d'un groupe local) qui est symetrique, contient I'identite et est telle que A ■ A pent etre 
recouvert par au plus K translates a gauche de A. 

he resultat principal de cet article montre que tout groupe approximatif est, grossierement, fini- 
par-nilpotent, ce qui repond par I'affirmative a une conjecture de H.Helfgott et de E. Lindenstrauss. 
On pent interpreter ce theoreme comme une generalisation a un groupe quelconque du theoreme de 
Freiman-Ruzsa sur la structure des parties finics a petit doublcmcnt du groupe additif des cntiers 
relatifs. 

Nous commengons par etablir un principe de correspondence entre les groupes approximatifs et les 
groupes (ou groupes locaux) localement compacts, puis nous en deduisons de nombreuses consequences 
issues d'un important article recent de Hrushovski. En particulier, nous montrons que tout groupe 
approximatif peut-etre represente par un groupe de Lie. 

Pour demontrer notre theoreme principal, nous appliquons des arguments, en substance dus a 
Gleason pour la plupart, qui ont vus le jour dans le contexte de la solution du cinquicmc probleme 
de Hilbert dans les annees 50. 

En guise d'application du theoreme principal, nous montrons une version fine du theoreme de 
Gromov, ainsi qu'un lemme de Margulis generalise conjecture par Gromov, et un resultat sur la 
presque nilpotence des groupes fondamentaux des varietes a courbure de Ricci presque positive. 
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1. Introduction 

Approximate groups. A fair proportion of the subject of additive combinatorics is concerned 
with approximate analogues of exact algebraic properties, and the extent to which they resemble those 
algebraic properties. In this paper we are concerned with sets that are approximately closed under 
multiplication, which we do not necessarily assume to be commutative, and more specifically with 
approximate groups. These are finite non-empty sets A with group-like properties which we shall state 
precisely later. First we will motivate the definition of an approximate group with some discussion and 
examples. 

Suppose first of all that A is a finite subset of some ambient group G = (G, •). This is the setting 
considered in essentially all of the existing literature, and the one of importance in applications. How- 
ever, as we shall see later, our method of proof is in fact more naturally adapted to a more general 
setting, in which A lies in a local group rather than a global one. 

It is easy to see that a finite non-empty subset A of G is a genuine subgroup if, and only if, we have 
xy~^ € A whenever x^y G A. Perhaps the most natural way in which a set A may be approximately 
a subgroup, then, is if the set A ■ A^^ := {xy^^ : x,y E A} has cardinality not much bigger than the 
cardinality of A: for example, we might ask that \A ■ A^^ \ ^ K\A\ for some constant K. 

Sets with this property or with the closely related property ^ ^^^1^1, where A^ := A- A = {xy : 
x,y E A}, are said to have small doubling, and this is indeed a commonly encountered condition in 
various fields of mathematics, in particular in additive combinatorics. It is a perfectly workable notion 
of approximate group in the abelian setting and the celebrated Freiman-Ruzsa theorem. Theorem 12.11 
below, describes subsets of Z with this property. However in (51] it was noted that in noncommutative 
settings a somewhat different, though closely related, notion of approximate group is more natural: 
A is an approximate group if it is symmetric in the sense that the identity id lies in A, ii a^^ E A 
whenever a E A, and ii A ■ A is covered by K left-translates of A. 
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As suggested above we consider in this paper a slightly more general (and perhaps more natural, 
in retrospect) "local" definition of approximate group in which there is no ambient global group G. 
It will be convenient to introduce the following definition. This requires the concept of a local group, 
which is discussed at some length in Appendix [Bl 

Definition 1.1 (Multiplicative set). A multiplicative set is a finite non-empty set A contained in a 
(symmetric) local group G — (G, •), such that the product set [A U ^^i)200 well-defined, where 
:— {a^^ : a G A} is the inverse of A. Strictly speaking, one should refer to the pair [A, G) as the 
multiplicative set rather than just A, but we will usually abuse notation and omit the ambient local 
group G. 

In some (abelian) examples, we will use additive group notation G = (G, +) rather than multiplica- 
tive notation G — (G, •). In such cases, we will refer to multiplicative sets as additive sets instead. 

Clearly, any finite non-empty subset of a (global) group G is a multiplicative set. The reader should 
probably keep this model case in mind throughout a first reading of this paper. Indeed the additional 
generality afforded by the local setting is only needed at a single, albeit critical, place in the argument 
in Section |9l One should informally think of a multiplicative set A as a set that behaves "as if" it were 
in a global group, so long as one only works "locally" in the sense that one only considers products of 
up to 200 elements of A and their inverses. The exponent 200 in Definition 1 1.11 is somewhat arbitrary, 
but for the purposes of studying approximate groups, the exact choice of this exponent is not important 
in practice, so long as it is at least 8 (see Theorem l5.3l for a precise formalisation of this assertion). For 
the reader familiar with Freiman homomorphisms (cf. 53, §5.3]), we remark that these are essentially 
the morphisms in the category of multiplicative sets. 

Definition 1.2 (Approximate groups). Let K ^ 1. Pl K -approximate group is a multiplicative set A 
with the following properties: 

(i) the set A is symmetric in the sense that id € ^ and a^^ ^ A ii a E A; 

(ii) there is a symmetric subset X C A^ with \X\ ^ K such that A ■ A C X ■ A. 

We will sometimes refer to actual (global) groups as genuine groups, in order to distinguish them from 
approximate groups. We define a global K -approximate group to be a iiT- approximate group A that 
lies inside a global group G. We refer to K as the covering parameter of the approximate group A. 

Remark 1.3. We will also have occasion to deal with infinite ii'-approximate groups, which are defined 
exactly as ordinary ii"- approximate groups, except that they are no longer required to be finite sets. A 
convex body in a Euclidean space, or a small ball in a Lie group, are examples of infinite approximate 
groups. Later we will introduce the important notion of an ultra approximate group, which is another 
example. However, by default, approximate groups in this paper will be understood to be finite unless 
otherwise stated. 

The connection between sets with small doubling and the apparently stronger property of being an 
approximate group was worked out in [51] , building on work of Ruzsa [44] ; see Remark 11.51 below. 

When we speak of an "approximate group" we shall generally imagine that K is fixed (e.g. K = 10) 
and that \A\ is large. Let us give some examples. 



4 



EMMANUEL BREUILLARD, BEN GREEN, AND TERENCE TAO 



Example 1 (Finite group). A 1-approximate group is the same thing as a finite group. 

Example 2 (Arithmetic/geometric progression). If e N is a natural number, then the arithmetic 
progression P(l; A^) := {— A^, • • ■ , A^} (which one can view inside the (additive) global group Z, or the 
local group {— 200A^, . . . , 200A^}) is a 2-approximate group. More generally, if G = (G, •) is any (global) 
group and g £ G then the geometric progression P{g, N) :— {g^^ , ■ ■ ■ ,g^} is a 2-approximate group. 

Example 3 (Generalised arithmetic progression). Let G = (G, +) be an abelian group, let Mi, . . . , G G 
for some r ^ 0, and let A^i, . . . , A^^ > be real numbers. We refer to the set 

P(ui, . . . ,Ur; Ni, . . . , Nr) := {niUi + • • • + rirUr : ni, . . . , n-r e Z; |ni| ^ A^i, . . . , \nr\ ^ A^^} 

as a generalised arithmetic progression of rank r. One easily verifies that this is a 2''-approximate 
group. 

Example 4 (Homomorphic images). Let : G — )• iJ be a homomorphism between local or global 
groups. If yl is a AT-approximate subgroup of G, then (f>{A) is a AT-approximate subgroup of H. This 
observation can be generalised to the case when </> is a Freiman homomorphism (of order 3) rather than 
a group homomorphism; see |53[ §5.3] for more discussion. Indeed, Freiman homomorphisms are very 
similar to homomorphisms of local groups, although for technical reasons we will rely on the latter 
concept rather than the former. 

Conversely, if i? is a AT-approximate subgroup of H, (f) is surjective, and ker((/)) is finite, then (j)~^{B) 
is a A'-approximate subgroup of G. In the latter case one can view the A'-approximate group (j)^^{B) 
as a "finite extension" of the AT-approximate group B by the genuine group ker(0). 

Example 5 (Large subsets). Let A be a A'-approximate group, and let A' be a symmetric neighbourhood 
of the identity in A such that A is covered by K' left-translates of A' . Then A' is a A' AT'- approximate 
group. This hints that approximate groups are considerably more numerous than genuine groups, 
because the latter property is preserved under passage to "large" subsets, whereas the former is not. 

Example 6 (Heisenberg example) . Let G be the free nilpotent group of step 2 generated by two gener- 
ators ui,U2- More concretely, one can take G to be the Heisenberg group 

(1.1) G:=(l\l) 

Vo 1 / 

with generators 

/iio\ , /ioo\ 
ui 1 and := o i i . 
Vooi/ Vooi/ 

Consider also the commutator 

r 1 -1-1 /^i l^ 

[Wl,M2j:=Mi ^2 WlU2=(010j; 

one has 

Vo 1 / 

for all integers ni,n2,ni2. 

Let Ni,N2 ^ 10 be real numbers. Define the nilprogression P{ui,U2'-, Ni, N2) to be the set of all 
words in ui, u^^, W2, that involve at most A^i occurrences of ui,Ui^ and at most A^2 occurrences of 
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U2, U2 ■ It is not difficult to verify tliat P{ui, U2; Ni, N2) is a symmetric neiglibourlrood of tlie identity 
which contains the set 



One can easily verify that P{ui,U2', Ni, N2) is a ii'-approximate group for some absolute constant K 
(for instance, one could take K — 100). 

Remark 1.4. The above example was constructed inside the Heisenberg group. Later on we will 
discuss a generalisation of this example to arbitrary nilpotent groups. These examples, which we will 
call nilprogressions, will be needed to state the precise version of our main theorem (Theorem I2.10p 
below. We will define them later in this introduction. 

Example 7 (Direct products). The direct product of a i^i-approximate group and a ii'2-approximate 
group is a ifi_ftr2-approximate group, and so one may build up examples of approximate groups using 
both subgroups and nilprogressions. 

Example 8 (Helfgott's example). The following example of HelfgotlQ is a less obvious way of combining 
a subgroup and a nilprogression. 

Let A C GL3(Fp) be the following set of 3 x 3 matrices: 



Here, r, s € are fixed and N is large yet much smaller than p. Then A is a 0(l)-approximate group. 

Note that A has the following form: it admits a subgroup H, normalised by A, such that A/H is a. 
a geometric progression. Indeed 



In the language of Example |4l ^ is a finite extension of a geometric progression by the finite group H. 

Each of the above examples was rather "algebraic" in nature, whereas the definition of approxi- 
mate group is somewhat combinatorial. We also have some more combinatorial criteria for generating 
approximate groups using sets of small doubling or tripling. 

Remark 1.5 (Relationship between small doubling and approximate groups). Let A be a non-empty 
finite subset of a global group G. If \A^\ < K\A\, then the set H -.^ {AU {id} U A-^)'^ a 0(XO(i))- 
approximate group that contains A; see (51] Theorem 3.9]. In a similar vein if ^ K\A\ or 
\A ■ ^ K\A\, then there exists a 0(if'^(^))-approximate group H of size \H\ = 0{K°^'^'>\A\) 

such that A can be covered by 0{K'-"-^^) left-translates gH of H; see [51] Theorem 4.6]. 

^See terrytao . wordpress . coiii/2009/06/21/f reimans-theorem-f or-solvabl6-groups/#coimiient-39705. 
■^Here and in the rest of the paper we use X = O/f (^), ^ Y, or Y 2>k X for two (standard) quantities X, Y and a 
(standard) parameter K to denote the assertion that \X\ ^ CkY for some (standard) quantity Ck > depending only 
on K, and similarly for other choices of subscripted parameters. We also adopt an analogous notation for nonstandard 
quantities; see Appendix [Al 



K^u^=K,U2]"^^ : |niK7Vi/10,|7i2K A^2/10,|ni2KA^iA^2/10} 



and is contained in the set 



{<iu^^[ui,W2]"^= : \ni\ < lOTVi, \n2\ ^ ION: 



2,|ni2| < 10A^iiV2}. 
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Our aim in this paper is to "describe" the structure of approximate subgroups in an arbitrary 
ambient group in terms of more expHcit algebraic objects such as those hsted in the examples. Here is 
one form of our main result in this regard. 

Theorem 1.6 (Main theorem, simple form). Let A be a global K -approximate group, thus it is con- 
tained in a (global) group G. Then there exists a subgroup Go of G and a finite normal subgroup H of 
Go with the following properties: 

(i) A can be covered by Ox(l) left-translates of Go! 

(ii) Go/H is nilpotent and finitely generated of ranf^ and step at most Ox(l),.' 

(iii) contains H and a generating set of Go- 

In particular, the group Go is finite-by-nilpotent, and hence also virtually nilpotent. Indeed, the 
stabiliser in Go of the conjugation action on H has finite index in Go and is a central extension of a 
finite index subgroup of Go/H, and therefore is also nilpotent. 

By specialising Theorem 11.61 to the combinatorial examples in Remark 11.51 we obtain an analogous 
structure theorem for sets of small doubling. 

Corollary 1.7 (Freiman-type theorem). Let A and B be finite non-empty subsets in a [global) group 
G such that \ AB\ ^ iir|yl| 5 . Then there exists a subgroup Go of G and a finite normal subgroup 
H of Go with the following properties: 

(i) A can be covered by at most Ok{^) right translates of Gq; 

(ii) Go/H is nilpotent and finitely generated of rank and step Oa'(1)- In particular, Go is finite- 
by-nilpotent and hence also virtually nilpotent. 

Proof. By [Si] Theorem 4.6], there exists a 0(iir'^(^^)-approximate group A' of size ©(X"^^^' |A|) such 
that A can be covered by 0{K'^^-^'>) right translates of A' and B can be covered by 0{K'-'^^^) left 
translates of A' . We may thus apply Theorem 11.61 to A' . □ 

Theorem ll.6l (or Corollarv ll.71) answers in the afhrmative a conjecture that we have been referring to 
as the Helfgott- Lindenstrauss Conjecture, on account of its having been raised independently in private 
communications by both Harald Helfgott and Elon Lindenstrauss. In fact, the conjecture is reasonably 
explicit in the comments surrounding |30i Theorem 1.1]. 

Remark 1.8 (The linear case). Various forms of the main theorem are also known in groups of Lie type 
of bounded dimension, as a consequence of results of many authors [?, El [7l [HI [HI [291 1301 1321 142] . 
For instance, in [17] an analogue of Theorem 11.61 was established in the case when G is a solvable 
algebraic group of bounded dimension over a finite field of prime order. In that case, the group Go/H 
has bounded rank, and the number of cosets of Go needed to cover A is polynomial in K. We have 
no examples to rule out the possibility that this polynomiality in K holds in all groups G, perhaps at 
the cost of weakening the rank and step bounds on Go/H. Unfortunately our methods, which rely on 
ultrafi-lter arguments, give no quantitative bounds on the covering number whatsoever. 



The rank of a finitely generated group is the least number of generators required to generate the group. The step is the 
length of the lower central scries, minus 1. 
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Remark 1.9 (Bounds on the nilpotent group). Our method allows us to give an explicit bound on the 
dimension (rank and step) of the nilpotent group Gq/H in Theorem ll.6l at the expense of replacing in 
item (iii) by a larger power of A. Namely, if we allow for H and the generating set of Gq to be contained 
in A^^, then we may ensure that the nilpotent group Go/H is £-nilpotent with £ = 0{K^ XogK). If we 
are happy to go as far as A'^^'^"'^^ then this may be further reduced to I ^ 61og2 K. Here we say that 
a group is £-nilpotent if it admits a generating set ui, . . . ,ui such that [u^, Uj] € {uj+i, ■ ■ ■ , Ui) for all 
i < j. In particular such a group admits a normal series with cyclic factors of length at most £, and 
so is also nilpotent of step at most £. We refer the reader to Theorem 12.121 and to Section [10] for a 
detailed statement and proof. 

Remark 1.10. Note that no bound is provided on the size of the finite group H in Theorem 11.61 other 
than that it is finite. Indeed, by considering A to be a large finite simple group it is not difficult to see 
that H can be arbitrarily large. 

We will in fact prove a much more precise version of Theorem 11.61 involving a slightly complicated 
type of approximate group which we call a coset nilprogression. We discuss this concept in some detail 
in the next section. For many applications, however, Theorem ll.6l is quite sufficient. 

Applications. We now give a small selection of applications to growth in groups and to Riemannian 
geometry; a greater variety is assembled in mi[ which also contains proofs of these statements. 

Polynomial growth conditions and Gromov's theorem. Firstly, Theorem 11.61 vields a quick proof of 
Gromov's theorem |26i on groups of polynomial growth. 

Theorem 1.11 (Gromov's theorem). Let G he a group of polynomial growth. That is, G is generated 
by a finite symmetric set S, and there are constants C and d such that \S^\ ^ Gn'^ for all n ^ N. Then 
G is virtually nilpotent. 

Remark 1.12. In fact, our arguments show that there is some function / : N — > N, f{n) — > oo, such 
that, if G does not have polynomial growth, then ^ n-'^'^"^ for all n. We do not get an explicit 
function /. However, if the control parameter Ox(l) in Theorem 12.101 were known to be polynomial in 
K, we could take f{n) = clogn. The best (in fact only) lower bound known for this function at present 
is (log log n)^, due to Shalom and the third author [SD]. It is conjectured by some, in the absence of 
any examples to the contrary, that f{n) > n"^, and possibly even that jS"*! ^ e'^^. 

In |32j Hrushovski also gave a derivation of Gromov's theorem from his Lie model theorem (see 
Theorem 13. 101 below). He in fact proved a strengthening of Gromov's theorem (see [32l Theorem 7.1] 
or Theorem lll.ll below'). We will be able to recover Hrushovski's result more directly (see Corollarv lll.2l 
below). In fact, our approach can also yield the following other strengthening of Gromov's theorem, 
which is uniform in the size of the generating set S and appears to be new. Recall that if ^ G N then we 
say that a group is ^-nilpotent if it admits a generating set ui, . . . , M£ such that [ui, Uj] G (uj+i, . . . , ui) 
for all i < j. 
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Theorem 1.13. Let d > 0. Then there is hq — no{d) > such that if G is a group generated by 
a finite symmetric set S with 1 E S for which \S"\ ^ n'^\S\ for some n ^ nQ{d), then G is virtually 
nilpotent. In fact G has a normal subgroup of index at most Od{i) which is finite-by-{0{d) -nilpotent) . 

Proof. The proof of this (and hence of Theorem ll.lip is a short enough deduction that we can give 
it here in the introduction. We refer the reader to Section [11] for more details. Let N — N{d) be a 
large quantity to be specified later, and let hq be sufficiently large depending on N and d. By the 
pigeonhole principle and the hypothesis jS*"! ^ n'^\S\ we see that if uq is sufficiently large depending 
on N then there exists n' , N n' rio/lOO, such that \S^°°'''\ (200)''|S'"' |. By Corollary O (which 
is quite easy) this implies that 5^" is a e'^^'^^-approximate group. By our main theorem, Theorem 11.61 
(and Remark II. 9p . we can thus find a finite- by- (0((i)-nilpotent) and hence virtually nilpotent group 
Go such that 5^" is covered by Od(l) left-translates of Gq. By the pigeonhole principle, if N is large 
enough, we can find a nonnegative m < 2n' such that S'^^^Gq — S^^Gq. Multiplying on the left by S 
repeatedly we conclude that S^+'^Gq = S"^Gq for all k ^ 0. Since S generates G, we conclude that 
G = S""Go = 5*2"' Go. Since 5^"' was covered by 0^(1) left-translates of G, Go has index 0^(1) in G, 
and so G is also virtually nilpotent. □ 

Riemannian manifolds. A. Petrunin suggested to us some years agcQ that a result such as Corollary 
11.131 would give a purely group-theoretical proof of a theorem of Fukaya and Yamaguchi [15\ according 
to which fundamental groups of almost non-negatively curved manifolds are virtually nilpotent. Recall 
that a closed manifold M is said to be almost non-negatively curved if one can find a sequence g„ 
of Riemannian metrics on it for which diam(M, f;„) ^ 1 while KM,g„ 1^ —1/n where KM.g„ is the 
sectional curvature. Indeed, a simple application of the Bishop-Gromov inequalities combined with 
Corollary 111.51 yields the following improvement assuming only a lower bound on the Ricci curvature 
and an upper bound on the diameter. 

Corollary 1.14 (Ricci gap). Given d € N, there is e{d) > such that the following holds. Let 
M = (Af, g) be an d-dimensional compact Riemannian manifold with Ricci curvature bounded below by 
—e and diameter diam(Af) ^ 1. Then 7r(M) is virtually nilpotent. 

This result is known to differential geometers and follows from the works of Cheeger-Colding [10] 
and Kapovitch and Wilking [35 . We refer the reader to Section Hi. II for more discussion and references 
concerning the above result. We only note that Corollary [TL5] yields in fact an explicit bound on the 
nilpotency class, namely that after passing to a subgroup of 7ri(A/) with index Od(l) and quotienting 
by a finite normal subgroup, we obtain a 0((i)-nilpotent group. 

Generalised Margulis lemma. Another corollary of Theorem 11.61 is a "generalised Margulis lemma" 
for metric spaces of a type conjectured by Gromov in [27l §5.F]. A metric space X is said to have 
bounded packing with packing constant K if there is if > such that every ball of radius 4 in X can 
be covered by at most K balls of radius 1. Say that a subgroup F of isometrics of X acts discretely on 
X if every orbit is discrete in the sense that {7eF:7-a;GS}is finite for every x £ X and for every 
bounded set E C X. 

^See also jhttp : //mathoverflow.net/questlons/1 1091 j 
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Corollary 1.15 (Generalized Margulis Lemma). Let K ^ I be a parameter. Then there is some 
e{K) > such that the following is true. Suppose that X is a metric space with packing constant K , 
and that T is a subgroup of isometrics of X which acts discretely. Then for every x €z X the "almost 
stabiliser" T^{x) = (5^(2;)), where S^{x) := {7 G F : d{'^ ■ x,x) < e}, is virtually nilpotent. 

Note that the space X is not assumed to be a manifold. The traditional Margulis lemma estbalishes a 
similar statement for subgroups of isometries of pinched negatively curved manifolds, or more generally 
under a curvature lower bound. 

Approximate groups and polynomial growth. Finally we remark on an additive-combinatorial appli- 
cation, which asserts that approximate groups have large subsets with "polynomial growth" . 

Theorem 1.16 (Approximate groups are locally of polynomial growth). Suppose that A is a global K- 
approximate group. Then A'^ contains a Oxii)- approximate group A' with (A')'* C A'^ and \A'\ \A\ 
such that |(v4')"| <k mO^(i)|A'| for all m ^ 1. 

This theorem is an immediate consequence of Theorem 12.101 and Proposition IC.5I below. 

Remark 1.17. The above argument converted nilpotent structure (or more precisely, coset nilprogression 
structure, see below) to polynomial growth. In the reverse direction, there is the result of Sanders [45] 
in certain monomial groups, in which polynomial growth is shown to imply a metric ball type structure, 
at least under the (rather strong) restriction that the approximate group A is normal in the ambient 
group G. 
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2. COSET NILPROGRESSIONS AND A MORE DETAILED VERSION OF THE MAIN THEOREM 

This section concerns the more precise variants of our main theorem, whose existence we hinted 
at in the first introductory section. Let us first recall the fundamental inverse sumset theorem for 
abelian approximate groups. This was first introduced by Freiman [14], and a simplified argument was 
subsequently given in the paper [43] of Ruzsa. Here is the theorem in the torsion-free setting. Recall 
the notion of a generalised arithmetic progression, defined in Example [3] above. 
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Theorem 2.1 (Freiman-Ruzsa theorem). Let G — (G, +) he a torsion-free {global) abelian group, and 
let K ^ 2 be a parameter. Suppose that A C_ G is a K -approximate group. Then AA = A-\-A-\-A-\-A 
contains a generalised arithmetic progression 

P{ui,...,Ur;Ni,...,Nr) 

with r ^ log'^'-^-' K and \P\ 3> 6^'°^'^' ' ^1^1- ^'^ particular A can be covered by 0(e'°s' ' ' ^) translates 
of P. 

Proof. See [l^ for the main part of this; the final assertion is then a consequence of Ruzsa's covering 
lemma, Lemma [01 For earlier results of this type with weaker bounds on r and P, see [51 US]- In [2S] 
it was noted that one can take r as small as [log2 K + e\ for any e > 0, at the cost of decreasing the 
size of \P\ somewhat; see also [3l|4] for prior results along these lines. □ 

Roughly speaking, Theorem 12.11 asserts that, in a global torsion-free abelian group such as the 
integers Z, approximate groups are "controlled" by generalised arithmetic progressions of bounded 
rank. In the case of abelian groups with torsion, the class of generalised arithmetic progressions is not 
sufficient, as one must also now deal with the example of finite genuine groups (Example [1]) . It is thus 
natural to introduce the concept of a coset progression H -\- P: the sum of a finite genuine group H 
and a generalised arithmetic progression P = P{ui, . . . , w^; A^i, . . . , Nj.). This concept is sufficient for 
the formulation of a Freiman type theorem in an arbitrary abelian group. 

Theorem 2.2 (Abelian Freiman-Ruzsa theorem). Let G — (G, +) be a (global) abelian group, and let 
K ^ 2 be a parameter. Suppose that A Q G is a K -approximate group. Then AA contains a coset 
progression H + P, where 

P ^ P{ui,...,Ur;Ni,...,Nr) 

is a generalised arithmetic progression with r ^ log°(^^ K, H is a finite abelian subgroup disjoint from 
P, and \H + P\ = \H\\P\ > e"i°s''<'' particular, A can be covered by 0(ei°s°'" ^) translates 

ofH + P. 

Proof. Again, see [l^; see also [24] for an earlier result in this direction. □ 

We turn now to the business of dropping the commutativity assumption. We will also drop the 
assumption that A is contained in a global group and merely assume that A is a subset of a local group 
G. Informally, this means that we will not require the multiplication law to be defined everywhere in 
G, but only in a certain neighborhood of id. We refer the reader to Appendix [B] for a precise definition 
and basic properties; see also [IHl IV. 3] for a discussion of the closely related notion of group chunk. 
We generalise the concept of a generalised arithmetic progression to this setting as follows. 

Definition 2.3 (Non-commutative progression). Let ui,...,Ur be r elements in a local group G = 
(G, •), and let A^i, . . . ,Nr be r positive real numbers. If all products gi . . . gn are well-defined in G, 
where each gi is equal to one of Uj or uj^ and, for each j = 1, . . . , r, the formal expressiorQ Uj and 
its inverse uj^ appear at most Nj times, then we call the set of such products a non-commutative 

^For this definition, we consider Ui and Uj to be distinct formal expressions when i j, even if and Uj take the same 
value in G, and similarly for , uj^ . Thus, for instance, P{u\, U2; 1, 1) contains uiU2 even if ui, U2 are equal. 
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progression of rank r and side lengths Ni, . . . , Nr and we denote it by P{ui, . . . , Ur] Ni, . . . , Nr). We 
refer to r as the rank of the non-commutative progression. 

Remark 2.4. One can view non-commutative progressions as multiparameter variants of balls in a 
word metric. For instance when all Nj take the same value N and one is working in a global group, 
the progression . . . , Ur', N, . . . , N) is comparable with the word ball B{N) of radius N in the 

group {ui,...Ur) for the word metric with generating set {mi,...,^^} in the sense that B{N) C 
Piui,...,Ur;N,...,N)CB{rN). 

In the global abelian setting, all generalised arithmetic progressions of bounded rank are automat- 
ically approximate groups with a bounded covering parameter K. This is not the case in general 
non-abelian groups, even in the global setting. For instance, if F is the free non-abelian group on 
two generators 61,62, then the non-commutative progression P(ei, 62] N, N) (which, as remarked ear- 
lier, is essentially the ball of radius N in F) grows exponentially in N, and one can easily verify 
that P(ei,e2', N, N) is only a if -approximate group for K growing exponentially in N. However, 
the situation is much closer to the abelian case if the ambient group G is nilpotent. Given the link 
between progressions and balls, the reader familiar with Gromov's theorem on groups of polynomial 
growth [26] (to be discussed later on) will not find this surprising. Indeed, it can be shown (though 
we will not do so here) that if G is a global nilpotent group of step s, a non-commutative progression 
P{ui, . . . ,Ur; Ni, . . . , Nr) in G will be a Or,s(l)-approximate group if A^i, . . . , A'^r are sufficiently large 
depending on r and s. 

This motivates the following definition. Given some generators mi, . . . , u^, let us recursively define 
an iterated commutator of degree k involving these generators for a natural number fc ^ 1 by declaring 
uf^,...,uf^ to be the iterated commutators of degree 1, and [g,h] to be a iterated commutator of 
degree j + k whenever g, h are iterated commutators of weight j, k respectively for some j, fc ^ 1. Thus 
for instance [[u2, "3^^], [w^^, 1*4]] is an iterated commutator of ui, U2, u^, U4 of degree 4. 

Definition 2.5 (Nilprogression) . Suppose that G is a local group and that s ^ is an integer. A 
nilprogression of rank r and s is a non-commutative progression P{ui, . . . , Ur] Ni, . . . , Nr) with the 
property that every iterated commutator of degree s -I- 1 in the generators ui, . . . ,Ur is well-defined 
and equals the identity id. 

Example 9. The generalised arithmetic progressions P{ui, . . . ,Ur] Ni, . . . , Nr) in Example [3] is a nil- 
progression (in additive notation) of rank r and step 1. The set P{ui,U2', Ni, N2) in Example [5] is a 
nilprogression of rank 2 and step 2. 

It can be shown (though we shall not do so here) that ii Ni, . . . , Nr are sufficiently large depending 
on r, s, and P{ui, . . . , Ur] GNi, . . . , CNr) is a well-defined nilprogression of step s for some sufficiently 
large G depending on r, s, then P(ui, . . . , Ur] Ni, . . . , Nr) is a Or,s (l)-approximate group. 

The concept of a nilprogression as defined above is related to, though not quite identical with, the 
one given in [5]. As a byproduct of our proof methods, we will be able to work with a more tractable 
subclass of nilprogressions, which we will call nilprogressions in C -normal form. These generalise the 
notion of a proper generalised arithmetic progression in the additive combinatorics literature, and are 
also close in spirit to the nilprogressions introduced in [52 . 
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Definition 2.6 (C-normal form). Let C ^ 1. A noncommutative progression 

P{ui,...,Ur;Ni,...,Nr) 

is said to be in C-normal form if the following axioms are obeyed. 

(i) (Upper-triangular form) For every i,j with 1 ^ i < j ^ r and for all four choices of signs ± 
one has 

In particular, [ui, Ur] = id whenever 1 ^ i < r. 

(ii) (Local properness) The expressions u"^ . . . u"'' are distinct as rii, . . . ,71^ range over integers 
with \ni\ iiV;, i = l,...,r. 

(iii) (Volume bound) One has 

(2.2) ^(2LA^iJ + 1) . . . (2LiV,J + 1) ^ |F| < C(2LiViJ + 1) . . . (2LA^,J + 1). 

The somewhat ugly expression (2[iViJ + 1) . . . (2[iVrJ + 1) is convenient to have in (12.21) for some 
minor technical reasons, but it would not do much harm for the reader to mentally substitute Ni . . . Nr 
for this expression instead if desired. The volume bound (12.21) is morally (up to some degradation 
in the constants C) implied by the other axioms of a nilprogression in C-normal form, when the 
Ni, . . . , Nr are sufficiently large, and one is working in a global group (or at least if one assumes 
P{ui, . . . , Ur; DNi, . . . , DNr) to be well-defined for some sufficiently large D = Dr.s), but for some 
further minor technical reasons it is convenient to state this bound explicitly in the definition. 

Example 10. The generalised arithmetic progressions P{ui, . . . ,Ur ■,Ni,...,Nr) in Example [3] will be 
in 1-normal form if it is proper, i.e. if all the expressions niui -I- . . . -f rirUr for \ni\ ^ Ni are distinct. 

Example 11. The set P(ui,U2] Ni, N2) in Example [S] is not in C-normal form for any bounded C, 
because [ui,U2] is non-trivial. However, the closely related nilprogression 

P{ui,u2,[ui,U2];Ni,N2,NiN2) 

of rank 3 and step 2 is in 1-normal form. The two sets are "comparable" in a number of ways; for 
instance, one can easily verify that 

P{ui,u2;^Ni,^N2) C P{ui,u2,[ui,U2];Ni,N2,NiN2) C P(ui, ua; CA^i, CA^a) 
for some absolute constant C (e.g. one can take C = 100). 

Remark 2.7. Note that in the global group case, the step of a nilprogression in C-normal form is less 
or equal to its rank. 

In Lemma IC.ll we will show that any non-commutative progression P{ui, . . . ,Ur', iVi, . . . , Nr) in 
C-normal form is "essentially" a Or,c(l)-approximate group. More precisely, we will show that 
P(ui, . . . ^Ur'jSNi, . . . ^eNr) is a Or.c.£(l)-£^PProximate group whenever e > is sufficiently small 
and the Ni's are sufficiently large depending on C, r . We will also show that every element of 
P(ui, . . . ^Ur'jSNi, . . . ^eNr) can be rewritten in the form where h £ H and \ni\ = 
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Or,s{£Ni), while conversely every such product with \ni\ ^ eNi obviously belongs to P{ui, . . . , u^; sNi, 
...,eNr). 

Just as in the abelian case, we need to account for genuine subgroups. The analogue of coset 
progression is a coset nilprogression, a concept we first define in the simpler setting of global groups. 

Definition 2.8 (Global coset nilprogression). Let G be a (global) group. By a coset nilprogression of 
rank r and step s in G, we mean a set P of the form tt^^{Q), where Go is a subgroup of G, H is a 
finite normal subgroup of Go, tt : Gq — > Go/i? is the quotient map, and Q is a nilprogression of rank r 
and step s in Gq/H. 

We say that P is in C -normal form if Q is in G- normal form. 

We can extend this definition to local groups, using the local notion of quotient group reviewed in 
Lemma lB.121 

Definition 2.9 ((Local) coset nilprogression). Let G be a (local) group, which we endow with the 
discrete topology. By a coset nilprogression of rank r and step s in G, we mean a set P of the form 
Tr~^{Q), where if is a finite genuine subgroup of G with a cancellative normalising neighbourhood Go, 

is a neighbourhood of H in Go with C Gq, WH = HW ^ W , -k : W W/H is the quotient 
map defined in Lemma IB. 121 and Q is a nilprogression of rank r and step s in W/H. 

We say that P is in G -normal form if Q is in G- normal form. 

We call H the finite group associated with P, and Q the nilprogression associated with P. If 
Q = P{ui, . . . , u,.; iVi, . . . , Nr), then we write P = Ph{ui, . . . ,Ur]Ni, . . . ,Nr). 

Example 12. A subgroup is a coset nilprogression of rank and step 0. More generally, the direct 
product of a subgroup with a nilprogression of rank r and step s is a coset nilprogression of rank r and 
step s. The coset nilprogression will be in G-normal form if the associated nilprogression is. 

Example 13. The set A constructed in Example [5] is a coset nilprogression of rank 1 and step 1, and is 
also in 1-normal form as long as iV < ^y^. 

Again, coset nilprogressions in normal form are essentially approximate groups; see Lemma lC.ll for 
a precise version of this statement. 

We are now ready to state our main technical theorem, which among other things implies Theorem 
11.61 and whose proof will occupy the bulk of this paper. 

Theorem 2.10 (Main theorem). Let A he a K -approximate group. Then A* contains a coset nilpro- 
gression P of rank and step Oxi^) and \P\ 3>a' |^|- Furthermore, P can be taken to be in OKi^)-normal 
form. 

We remark that precursor results to this theorem in the case of nilpotent or solvable groups were 
obtained in [S] |6l [131 IIZl HH [52] . Theorem 12.101 also provides an independent proof of a qualitative 
version of the abelian results of Theorem 12.11 and Theorem 12.21 which, in contrast to the other known 
proofs of these results, manages to almost completely avoid the use of Fourier analysii^. 



However, our argument still uses results relating to Hilbert's fifth problem which require Fourier-analytic tools, such as 
Pontryagin duality, even in the abelian setting. 
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It is easy to see that Theorem 12.101 imphes Theorem 11.61 by taking Go to be the global group 
generated by P. The key point here is that a group generated by a set ui, ... ,Ur is nilpotent of step at 
most s if every iterated commutator of the mi, . . . , u,- of degree s + 1 is trivial. A proof of this assertion 
may be found in Hall's book |28) . 

By standard non-commutative product estimates, we can also establish the following Freiman-type 
theorem for sets of bounded doubling. 

Corollary 2.11 (Freiman-type theorem). Let K ^ 1. Let G be a (global) group and A, B be finite 
non-empty subsets of G such that \AB\ ^ K\A\^/'^\B\^^'^ . Then there exists a coset nilprogression P of 
rank and step 0/f(l) with \P\ 3>a' |^| which is in Oxi^) -normal form, such that A can be covered by 
Oi<-(l) left-translates of P, and B can be covered by 0/^(l) right-translates of P. 

Proof. This follows immediately from combining Theorem 12.101 with [511 Theorem 4.6]. □ 

In Section [TOl we will show the following explicit bounds on the rank and step of P. 

Theorem 2.12 (Bounds on the rank and step of the nilprogression). In Corollarv \2. and in Theorem 
\2.10\ if A is assumed to be a global K-approximate group), at the expense of replacing the conclusion 
P C with the weaker statement that P C A^^ , the coset nilprogression P can be taken to have rank 
and step at most 0{K^ logK) while remaining in OK{i)-normal form. Moreover, if we settle for the 
weaker inclusion P C A^'^'^^\ one can ensure that P has rank and step at most 61og2-ft' (while still 
remaining in O K(i-) -normal form) . 

It is likely that the numerical constants 6 and 12 here can be improved, but we will not pursue such 
improvements here. 

Local approximate groups can be embedded in global groups. As we have remarked above, the 
approximate groups A considered in this paper are local in the sense that we do not need to assume 
that A lies in a global group G. However as a consequence of Theorem l2.10[ the more detailed version 
of our main theorem, we have the following statement. It asserts that, at least at the qualitative level, 
there is in fact no loss of generality in dealing with the global case. 

Theorem 2.13. Suppose that A is a K-approximate group. Then A'^ contains a Ok (i)- approximate 
group A' with (A')'^ C A'^ and \A'\ \A\ which is isomorphic to a subset of a global group G. 

This theorem follows from Theorem 12 . 101 and the fact (which we prove in Lemma FC.Sp that a large 
portion of a coset nilprogression in normal form can be embedded in a global group. This theorem can be 
viewed as a discrete analogue to a recent result of Goldbring and van den Dries [SS], who established 
that every locally compact local group is locally isomorphic near the identity to a locally compact 
global group (thus there is a neighbourhood of the identity in the former group that is isomorphic to 
a neighbourhood of the identity in the latter group). One should also compare this result with Lie's 
third theorem that every local Lie group is locally isomorphic to a global Lie group (see Theorem lB.161 
and the discussion in Serre's book [49]). 
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3. Ultra approximate groups and Hrushovski's Lie Model Theorem 

In the next section we will give an outline of the argument we shall use to prove Theorem 12.101 
An extremely important component of it will be a Lie Model Theorem that implicitly appears in a 
remarkable paper of Hrushovski [321 Theorem 4.2] , which provided the foundation for much of the work 
here, and for which we will give a self-contained proof later in this paper. We can state this theorem 
very informally as follows: 

Theorem 3.1 (Hrushovski's Lie Model Theorem, informal version). In a suitable limit, an approximate 
group is virtually modelled by a precompact neighbourhood of the identity in a Lie group. 

Of course, to make this theorem more precise, one has to formalise terms such as "suitable limit" , 
"virtually" , and "modelled" . We shall do so presently, but first we point out that Theorem 13.11 is 
very similar in spirit to a key step [261 §7] in Gromov's proof of his celebrated theorem on groups of 
polynomial growth, which we state informally as follows. 

Theorem 3.2 (Gromov's Lie Model Theorem, informal version). In a suitable limit, a group oj poly- 
nomial growth can be modeled by a finite- dimensional locally compact space with a transitive isometric 
action of a Lie group. 

To deepen the analogy between the two results, we note that Theorem 13. II and Theorem 13.21 both 
require the deep body of results surrounding the solution to Hilbert's fifth problem on the topological 
description of the category of Lie groups (see [35]) in order to bring into view the Lie structure, which 
is not manifestly present when one first takes a limit. There are however some technical differences 
between the precise formulations of Theorem 13.11 and Theorem 13.21 In the latter theorem, one has a 
group G (of polynomial growth) generated by a finite set S. This gives a metric on G, the word metric 
given by the generating set S. Gromov then looks at the discrete balls S"", n — 1,2,3... "from a 
distance" to get some continuous limit metric space X. For example if G = Z and 5 = {—1,0,1}, then 
5" = {— n, . . . ,n}, and it is heuristically clear that these discrete intervals 5", after rescaling by n, 
"converge" in a suitable sense to the continuous interval [—1,1] C R. 

To effect this limit, Gromov introduced what is now known as Gromov-Hausdorff convergence of 
a sequence of metric spaces. In subsequent work of van der Dries and Wilkie [57] a slightly different 
approach, using ultralimits (or non-standard analysis) was pioneered. This construction is now known, 
in the geometric group theory literature, as the asymptotic cone. 

The asymptotic cone, then, is (a quotient of) an ultraproduct of the sequence of balls (S'")„gN- We 
will use a similar limii0 in order to formalise Theorem 13.11 namely an ultraproduct A of an arbitrary 
sequence (An)neN of i^-approximate groups, an object we call an ultra approximate group. We now 
define this term more precisely. 

Definition 3.3 (Ultra approximate group). Throughout this paper, we fix a non-principal ultrafiltcr 
a £ f3N\N (see Lemma [A. II for a definition of this concept). If ii' > is a real number then an ultra K- 
approximate group is an ultraproduct A := rin->.Q^ni where each An is a (standard) ii'-approximate 

^In 1321 . more saturated limits (not necessarily constructed using ultrafilters) were also considered, but we will not need 
such constructions here. 
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group. Thus, A is the space of ah formal hmits hnin->.Q an with On G ^n, where two formal limits 
limn-i-Q fln and limn_j.Q, a'„ are considered equal if an — a'„ for all n sufficiently close to a (i.e. for all n 
in an a-large subset of N). See Appendix \X\ for more discussion on ultraproducts. Often we will not 
need to refer to K explicitly, in which case we speak simply of an ultra approximate group. 

Note that we allow the approximate groups to lie in different ambient groups Gn (much as the 
notion of Gromov-Hausdorff convergence also does not require the spaces Xn involved to all live in a 
common ambient space). Ultraproducts are a model-theoretic limit, in contrast to the more geometric 
notion of a limit defined by Gromov-Hausdorff convergence. There are two key properties of these 
model-theoretic limits that make them convenient to use for our purposes. The first is Los's theorem^ 
which roughly speaking asserts that any property that can be stated in the language of first-order logic 
holds for an ultraproduct A ~ Hn-s-a ^'^'^ only if it holds for those An with n sufficiently close to 
a; see Theorem lA.6l The second is countable saturation, which we will use to establish the completeness 
of a certain (pseudo)metric space associated to an ultra approximate group; see Proposition 16. II 

Next, we discuss what it would mean tcl§ "model" an ultra approximate group A. Informally, a 
model would seek to describe the "coarse-scale" behaviour of A, and in particular be able to predict 
when an orbit id, a, a^, a'^, . . . of an element a of A will "escape" A, while ignoring the "fine-scale" 
behaviour of A. Such a model will be formalised by a homomorphism : A^ L of local groups 
that obey certain good properties (see Definition 13.51 below). Before we present this formal definition, 
though, we first discuss some key examples of ultra approximate groups and their models. 

Example 14 (Nonstandard finite groups). Suppose that An is a sequence of (standard) finite groups; 
then the ultraproduct A Tln-i-Q! ^'^ ultra approximate group. In this case, A is in fact a 
genuine group, with group operation given by the law 

(lim On) ■ (lim 6„) := lim(a„6„). 

n— fa n—^a n— fa 

We will refer to such groups as nonstandard finite groups. A typical example of a nonstandard finite 
group is the nonstandard cyclic group@ 

l/NI := Jl Z/nZ, 

n— >a 

where A'^ <E *N is the nonstandard natural number 

(3.1) N := lim n. 

n— >a 

In a nonstandard finite group A, there are no elements that ever escape A: if a G A, then one has 
a" e A for all n E N. As such, it will turn out that A can be modeled by a trivial homomorphism 
: A — > {id} to the trivial group. 

use of the term "model" here is not, strictly speaking, the precise notion that is used in model theory, but is closer 
to the notion of a "Freiman model" from additive combinatorics, as used for instance in I43| . |24| . 

'^This group is the analogue of the profinite completion Z = lim<_ Z/nZ of the integers, but is built using the machinery of 
ultralimits rather than inverse limits. The two groups are however not identical. For instance, Z is torsion-free, whereas 
li/NZ can contain torsion; for example if A'^ is even, or equivalently if the set of even natural numbers is o-large, then 
Z/7VZ contains the element N/2 mod A^, which has order 2. But see Remark l3.4l below for a link between ultraproducts 
and inverse limits. 
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Example 15 (Nonstandard intervals). Now consider the sequence An '■— P{l;n) = {— n, ...,n} of 
(standard) arithmetic progressions in Zi. The ultraproduct A : — rin— t)e viewed as the 
nonstandard arithmetic progression A = P(l; N) = {—N, . . . , N} in the nonstandard integers *Z 
J^^_j_^ Z, where N was defined in (j3.ip . Then A is an ultra approximate group, and it can also be 
viewed as a local group inside the nonstandard integers *Z. 
Consider now the map tt : A — > R defined by 

7r( lim On) := st lim — , 

n— fa n— n 

where stx is the standard part of a nonstandard real x (see Appendix IX)) . Thus, for every standard 
£ > 0, one has 

7r( lim On) — e ^ — ^ 7r( lim «„) + e 

n— >-a U n— fa 

for all n sufhciently close to a. One may also write 

, , a 
7i"(a) = St — 

for all a G A. The map tt is a homomorphism of local groups from A into [—1,1]. It is surjective since, 
for any 7 G [—1, 1], the nonstandard integer 

X :— [■^Nl = lim [7nJ , 

n— f 00 

where [J is the integer part function, has image 7r(a;) = 7. The kernel ker(7r) is the set of a; G A with 
X ~ o{N) (thus if a; = linin-s. a Xix and £ > is standard, then \xn\ ^ £ii an Q-large set of u). For instance, 
every standard integer lies in ker(7r), as do some non-standard integers such as [\/^J = linin-foo Lv^J • 

There are similar maps frono A™ to [~m, m] for any fixed natural number m, which by abuse of 
notation we also call tt. Informally, these maps model A by the interval [—1,1], and more generally 
model A™ by [—to, to]. In this particular case, the model tt : A — )■ [—1, 1] of the ultraproduct A can 
be viewed as a limiting object for models TTn : An [—1, 1] of the individual factors An, by defining 
7Tn{a) ^. However, in more general situations, the model for the ultraproduct is only a limit for 
approximate models of the factors, and this is one reason why we need to work in the ultraproduct 
setting as much as we do. 

The model tt : A™ — ^ [—to, to] is not injective: if 7r(a) is trivial, this does not imply that a is 
trivial. However, tt does have an injectivity-like property which will be important later, which roughly 
speaking asserts that if 7r(a) is small, then a is small. For instance, observe that if a G A^™" is such 
thalQ 7r(a) G (—1, 1), then a G A. This property on the model tt can be used to derive some important 
facts about the ultraproduct A; for instance it implies the escape property that if a, a^, ... , a^™ all lie 
in A^*^, then a lies in A. These sorts of escape properties will play a major role in our arguments in 
later sections. 



Strictly speaking, as we are currently in an additive setting, one should write mA = A + . . . + A rather than 
A'" = A ■ . . . ■ A here. 

"'^"'^This claim is not quite true when 7r(a) is —1 or +1, as can be seen for instance by considering a = N + 1 = limn-+a n+1. 
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Example 16 (Generalised arithmetic progression). We still work in the integers Z, but now take An to 
be the rank two generalised arithmetic progression 

An P(l, n^O; n, n) := {a + bn^° : a,b e {-n, . . . , n}}. 

Then the ultraproduct A := Hn-i-a ^^'^ subset of the nonstandard integers *Z of the form 

A = P(l, 7Vi°; iV, N) = {a + bN^° ■.a,be {-N, N}}. 

This is an ultra approximate group which can be modeled by the Euclidean plane ]R^, using the model 
maps TT : A™ defined for each standard m by the formula 

7r(a + feiViO) fst-,st-^ 

whenever a.b ~ 0{N). The image 7r(A™) is then the square [—in, m]^. As before, if a G A^*"^" is such 
that 7r(a) e (—1, 1)^, then a G A; this can be used to conclude that if a, a^, ... , a^™ G A^°, then a e A. 
Note here that while A lives in a "one-dimensional" group *Z, the model is "two-dimensional". This 
is also reflected in the volume growth of the powers A™ of An for small m and large n, which grow 
quadratically rather than linearly in m. 



Example 17 (Heisenberg box, I). This example is related to the Heisenberg example in Example |6l We 
take each An to be the "nilbox" 

(3.2) ^= {(TJ ? ) ^ J ?) ^ ^ ^ ■ 

This is not quite an approximate group because it is not quite symmetric (cf. Example [5]), but we will 
ignore this technicality for sake of exposition. In any case it can be repaired in a number of ways, for 
instance by replacing An with An U An^ ■ Once again we consider the ultraproduct A :— rin-i-Q 
this is a subset of the nilpotent (nonstandard) group yo i *z j, consisting of all elements yo i y j with 
|a;|, \y\ ^ TV and \z\ ^ iV^; again, this is a (discrete) local group. 
Consider now the map 

A 8 / 1 R R\ 
TT : A -> 1 R 

V 1 / 

defined by 



/ / 1 X z 

3.3) TT 1 y 

Wool 







st 

St jv 




))- 




1 






\0 








This is easily seen to be a homomorphism (of local groups) to the Heisenberg group, whose image is 
the compact set 

(3^4) {(Jfj)c(J!|):N,|„|,|=Kl}. 

Informally, it models A (or A^) by what is essentially a unit ball in this Lie group. As before, we have 
the injectivity-like property that if a e A^'^™ is such that 7r(a) is sufficiently close to the identity, then 
a S A; as such, one can again establish the escape property that if a, a^, ... , a^^^ all lie in A^", then a 
lies in A. 
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Example 18 (Heisenberg box, II). This is a variant of the preceding example, in which the (not quite) 
approximate groups now take the form 

(3.5) ^"-^KisT?) H2;nMynKn,|zJ <nio} 

so that the ultrahmit A := Iln^a takes the form 

{(s ; 9 ^(sT-D^w.M^iv.N 

Now consider the map 

TT : 

defined by 

irffoiy]] = fst — ,St — ,St^r7-) . 

The image of this map is the unit cube [—1, 1]'^, and is in particular compact. It is also a homomorphism 
of local groups, since 

but the nonstandard real xy' /N^'^ = 0{N'^ /N^^) is infinitesimal, and so the previous expression is 
equal to 

( x + x' y + y' z + z'\ 

which establishes the homomorphic nature of tt. 

Here we note that the homomorphism tt : A^ — > R'^ is not associated to any exact homomorphisms 
TTn from Afj to M!^. Instead, it is only associated to approximate homomorphisms 

//I x„ zn \\ fXn Vn Zn \ 

Wool// Vn n n^^' J 
into . Such approximate homomorphisms are somewhat less pleasant to work with than genuine 
homomorphisms; one of the main reasons why we work in the ultraproduct setting is so that we can 
use genuine group homomorphisms, or at least local group homomorphisms, throughout the paper. 

Note that the preceding example p.2p admits a homomorphism tt onto the abelian group M? by 
composing the map (|3.3p with the natural map from ( o i ^ ) to its abelianisation M? . However the 

kernel of tt is, for us, too "big" . In particular it contains every ( ^ i o ^ , and in particular contains 
elements of A* not in A. By contrast there are no such elements in the example p.5|) . In particular, 
we can still use the model tt to establish the same escape property for A as before, namely that 
whenever a, . . . , a^^" e A^*^, one has a e A. 

We also note the sets A™ for small m and large n grow cubically in m in this example, and quartically 
in m in the previous example. This is consistent with the model groups having homogeneous dimension 
3 in the current example and 4 in the previous example. 

In all the above examples, the model group L was a Lie group. We give now give some examples to 
show that the model need not initially be of Lie type, but can then be replaced with a Lie model after 
some modification. 
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Example 19 (Nonstandard cyclic group, revisited). The first example is the nonstandard cyclic group 
A Z/2^Z ~ Z/2"Z. This is a nonstandard finite group and can thus be modeled by the 

trivial group {id} as discussed in Example 1 141 However, it can also be modeled by the compact abelian 
group Z2 of 2-adic integers using the model tt : A — > Z2 defined by the formula 

7r(a) :— lim a(mod 2") 

n— voo 

where for each standard natural number n, a(mod 2") G {0, . . . , 2"^^} is the remainder of a modulo 
2" (this is well-defined in A) and the limit is in the 2-adic metric. Note that the image 7r(A) of A is 
the entire group Z2, and conversely the preimage of Z2 in A^ = A is trivially all of Z/2^Z; as such, 
one can quotient out Z2 in this model and recover the trivial model of A. 

Example 20 (Nonstandard abelian 2-torsion group) . In a similar spirit to the preceding example, the 
nonstandard 2-torsion group A := (Z/2Z)^ = Iln^Ql^/^^)" can be modeled by the compact abelian 
group (Z/2Z)^ by the formula 

7r(a) := lim 7r„(a) 

n— fcjo 

where 7r„ : A ^ (Z/2Z)" is the obvious projection, and the limit is in the product topology of (Z/2Z)'^. 
As before, we can quotient out (Z/2Z)'^ and model A instead by the trivial group. 

Remark 3.4. The above two examples can be generalised to model any nonstandard finite group G = 
rin-s-a equipped with surjective homomorphisms from G^+i to G„ by the inverse limit of the G^- 

Example 21 (Lamplighter group). Let G be the lamplighter group Z k (Z/2Z)^, where Z acts on 
(Z/2Z)^ by the shift T : {Z/2Z f defined by T(a„)„gz := (an+i)nez- Thus the group law in G is given 

by 

{hx){j,y) := {i+j,x + T''y). 
For each n, we then set An C G to be the set 

An ■■= {{i, x) e G ; i e {-1, 0, +1}; x E (Z/2Z)"}, 

where we identify (Z/2Z)" with the space of elements (a„)„gz of (Z/2Z)^ such that a„ 7^ only for 
n S {1, . . . ,n}. These sets An are not quite approximate groups because they are not symmetric, but 
they are close enough to approximate groups for this discussion. For instance, they have bounded 
doubling or bounded tripling, and An U A~^ is an approximate group. We model the ultraproduct 
A := n,wa AnCZK *(Z/2Z)^ by the group 

G x^G -.^ m,x),{j,y)) e G X G : I = j} 

using the map 

7r((j, lim(a^"))„ez)) := ((«, (lim a^"))„ez), (i, (lim a|,"^,J„gz))- 

n— fa n— >-a n— >a 

Roughly speaking, 7r(a) captures the behaviour of a at the two "ends" of (Z/2Z)^. The image 7r(A) 
of A under this model is then the compact neighbourhood of the identity 

^(A) = m, x), (*, y)) e G : t e {-1, 0, +1}, x e (Z/2Z)N, y E {Z/2Zf\^'} 
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where we embed (Z/2Z)N and (Z/2Z)^\f* as sub groups of (Z/2Z)^ in the usual manner. One can also 
compute the images 7r(A'") for larger values of m, although they are a bit more complicated. One can 
verify the escape property that if (7,5^, . . . ,g^'^'^ € 7r(A^°) for some g E G x G, then g £ 7i'(A); here it 
is essential that we use both of the two factors of G G, as the claim is false if we project tt to just 
one of the two factors G, or to the base group Z. So, in this case, one needs a moderately complicated 
(though still locally compact) group G x^G to properlj0 model A and its powers A™. However, if 
we pass to the large subset A' of A defined by A' := rin-fa ^ni where 

a; := {{i, x)eG:i = 0;xe (Z/2Z)"} 

then A' is now a nonstandard finite group (isomorphic to the group (Z/2Z)^ considered in Example 
1201 and can be modeled simply by the trivial group {id}. Thus we see that we can sometimes greatly 
simplify the modeling of an ultra approximate group by passing to a large ultra approximate subgroup. 

Let us formalise the properties enjoyed by the above examples in the following definition, which will 
play a key role in this paper. 

Definition 3.5 (Good models). Let A be an ultra approximate group. A good model for A is a 
symmetric local topological group L (see Definition IB. ip . together with a homomorphism tt : A^ — !■ L 
of local groups with the following properties: 

(i) (Thick image) There exists an open neighbourhood of the identity Uq in L such that tt^^{Uq) C 
A and Uo C 7r(A). In particular kerTr C A; 

(ii) (Compact image) 7r(A) is contained in a compact set. 

(iii) (Approximation by "internal" sets) Suppose that F C U ^ Uq, where F is compact and U 
is open. Then there is an ultraproduct A' = Hn^a of finite sets A'^ C An such that 
n~^{F) C A' C n-\U). 

We will often abuse notation and refer to just L or vr as the good model for A, rather than the pair 
(L,7r). 

Remark 3.6. Properties (i) and (ii) together imply that L is locally compact. We leave it to the reader 
to check that the examples given above have all of the properties of this definition. One can think 
of a good model as accurately describing the "coarse-scale" structure of the ultra approximate group 
A, without directly controlling the "fine-scale" structure. For instance, in the example (|3.5p which is 
"abelian at coarse scales" but "2-step nilpotent at fine scales", the model tt only detects the abelian 
structure and not the 2-step nilpotent structure. 

Remark 3.7. In (iii), if F and U are symmetric neighbourhoods of the identity, then A' can be chosen 
to be symmetric (since one can replace A' with A' n (A')^^). As L is locally compact, we may shrink 
U to be precompact; then can be covered by finitely many translates of F, and thus A' is then an 
ultra approximate group. 



This can also be seen from volume growth considerations: grows like 4"*, which is also the rate of volume growth 
of 7r(A) in G Xz G, whereas the volume growth in a single factor G would only grow like 2™, and the volume growth in 
Z is only linear in m. 
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Finally, we need to explain the adjective "virtually" in Theorem 13.11 In group theory, "virtually" 
means "after passing to a finite index subgroup". Note that a subgroup G' of a group G has finite 
index if and only if G can be covered by finitely many left-translates - or, equivalently, right-translates 
- of G". This motivates the following definition. 

Definition 3.8 (Large approximate subgroups). Let A, A' be ultra approximate groups. We say that 
A' is a large ultra approximate subgroup of A if one has (A')** C A**, and A can be covered by finitely 
many left-translates of A'. 

Remark 3.9. It would be more aesthetically pleasing to have A' C A instead of (A')^ C A"*, but we 
need the exponent 4 in the inclusion for some minor technical reasons. Note that the property of being 
a large ultra approximate subgroup is transitive. 

We are now in a position to state Hrushovski's Lie Model Theorem. 

Theorem 3.10 (Hrushovski Lie Model Theorem). Let A be an ultra approximate group. Then there 
is a large ultra approximate subgroup A' of A such that A' admits a local Lie group as a good model. 

We will prove this theorem in ^ As stated above, the basic idea of the proof is to first establish 
that A itself admits a locally compact local group as a good model. Here results of multiplicative 
combinatorics, and in particular a lemma of Sanders ^S] (see also [12])) are critical. Once this is done, 
Theorem l3 . 101 follows relatively quickly from the deep results in the literature on Hilbert's fifth problem. 
This theorem will then play a key role in the proof of Theorem 12.101 in two ways: firstly by allowing 
us to establish certain "escape" properties on (ultra) approximate groups that will be used to build 
useful metric structures on these groups; and secondly by giving a natural notion of the "dimension" 
of an (ultra) approximate group which we will need to induct on. Note that one can invoke Lie's 
third theorem (Theorem lB.16P to upgrade the local Lie group in Theorem l3.10l to a connected, simply 
connected, global Lie group, but for technical inductive reasons it will be more convenient to keep the 
model in the category of local Lie groups for now. 

Theorem 13 . 1 01 will be proven in SJS] We will also establish a "global" variant of this theorem later, 
first in a weak form as Proposition 16.121 and then in a stronger form as Theorem 110.101 

4. An outline of the argument 

In the previous section we introduced the notion of a (Hrushovski) Lie model, one of the key technical 
tools we will use to prove Theorem 12.101 In this section we outline the argument for this proof as a 
whole. 

Our aim is to show that every i^T-approximate group is controlled in some sense by a coset nilpro- 
gression of rank and density Ok {I). We shall prove this by contradiction, assuming that there is a 
sequence (j4n)nGN of i^- approximate groups for which the statement fails in the limit for any given 
choice of implied constant in the 0^(1) notation. In particular, the cardinality |yln| will go to infinity 
as n — > oo. We assemble these approximate groups into an ultra approximate group A := Hn-s-a 
Our assumption implies that A is not "controlled" in a certain sense by what we call an ultra coset 
nilprogression, which we now define. 
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Definition 4.1 (Ultra coset nilprogression) . An ultra coset nilprogression is an ultraproduct P = 
rin-i-a '^^ coset nilprogressions P„ — P{ui,n, ■ ■ ■ , Ur.n', Ni^„, . . . , iVr,n) of fixed (standard) rank r and 
step s. We then say tliat P has rank r and step s. If the are also all in C-normal form for some 
(standard) C independent of n, we say that the ultra coset nilprogression is in normal form. We 
call Ni := limn-i-Q -^i.n for i ~ 1, . . . , r the lengths of the ultra coset nilprogression, and say that the 
nilprogression is nondegenerate if all the Ni are unbounded. 

We define the concept of an ultra nilprogression similarly, but replacing "coset nilprogression" by 
"nilprogression" throughout. 

As with all ultraproducts, it suffices to have the P„ obey the stated properties for all n sufficiently 
close to a, as one can redefine P^ arbitrarily on the remaining values of n without affecting the 
ultraproduct P. Note that an ultra nilprogression P can be expressed as 



where r is the rank, ui, . . . ,Ur are elements of the ambient nonstandard local group, and iVi, . . . , Nr 
are nonstandard positive reals. 

To obtain the contradiction, then, it is sufficient to establish the following ultraproduct version of 
our main theorem. 

Theorem 4.2. Suppose that A is an ultra approximate group. Then A^ contains a nondegenerate 
ultra coset nilprogression P in normal form with \P\ ^ |A|. 

Here |P| ^ |A| means that the non-standard numbers |A| and |P| satisfy |A| = 0(|P|), or in other 
words that there is a (standard) number C > such that \An\ ^ C\Pn\ for an a-large set of n £ N. See 
the end of Appendix [Al for more information. 

The Hrushovski Lie model theorem, Theorem I3.10[ will be a key tool in establishing this, as we 
discuss below. In addition to this theorem, a further fundamental additional concept in our argument 
will be the notion of an escape norm. 

Definition 4.3 (Escape norm). Let A be a multiplicative set. For a group element g E A^'~', we define 
the escape norm \\g\\e.A G [0, 1] to be the quantity 



Recall that by convention, the statement 5' G A is false if is not well-defined. Now suppose that 
A is a nonstandard multiplicative set, i.e. an ultraproduct A = ]^^_^^ An of standard multiplicative 
sets An- If 5 = lim„_j.Q (jn is an element of A^°, we define the escape norm Uglje.A G *[0,1] to be the 
quantity 



The escape norm can always be defined, but there are some remarkable lemmas essentially due to 
Gleason |20| concerning its properties when A is an approximate group. Specifically we will show in fj8] 
that there is a set A' controlling A for which the escape norms satisfy (precise versions of) the following 
estimates: 



P^P{ui,...,Ur;Ni,...,Nr) 




\\g\\e,A ■■= lim \\gn\\e,A„- 
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(i) (Product property) If gi,. ■ . ,gn e A' then \\gi . . .gn\\e,A' < ||5i|!e,A' H h ||gn||e,A'; 

(ii) (Conjugation property) E g,h e A' then \\h^^gh\\e.A' ^ HsUcA'; 

(iii) (Commutator property) If (/, /i £ A' then || [(/, /i]||e.yi' <C H^UcA' 11^*11 e,A'- 

These estimates, which we shall informally term "Gleason's lemmas" , will be proven in fJ51 They are 
valid in both the finitary and the ultralimit settings; the latter will be deduced, quite straightforwardly, 
from the former. 

The remarks in the following paragraph pertain to the finitary situation. To prove the Gleason 
lemmas, the set A' must be what we call a strong approximate group. The precise definition of this is 
Definition 1 7. II It is by no means obvious that there is a large strong approximate group A' contained 
in A'^, but this will follow from the Hrushovski Lie model theorem (Theorem 13. lOp . basically because 
small balls in a Lie group are automatically strong approximate groups, and can then be pulled back 
by the model map. 

One A' has been defined, Gleason's lemmas are proven by an argument closely analogous to that of 
Gleason himself [20]. We shall say nothing further about the details here; the argument is self-contained 
and is discussed in Jj8l 

With Gleason's lemmas in hand, let us describe the rest of the argument. 

Firstly, the set H = {g : \\g\\e,A' — 0} of elements which do not escape is a normal (genuine) 
subgroup of A'; this follows from (i) and (ii). We may quotient by H to get an ultra approximate 
group ^0 '■= A' /H, all of whose non-identity elements have nonzero escape norm. We shall call such 
approximate groups NSS approximate groups, in analogy with the no small subgroups property in the 
theory of locally compact groups. 

Now, if gi e Aq is an element other than the identity with smallest (nonzero) || • ||e,Ao"Gscape norm 
then we shall see that in fact, if A' is chosen appropriately, gi (z Aq. Item (iii) then implies that for any 
ft. G Aq, [gi,h] G Aq has smaller escape norm than gi, and hence must be the identity. In other words, 
gi is central in Aq and we may quotient again to get a new approximate group Ai := Aq / (gi) . We are 
being quite fuzzy at this point; in fact, the quotienting takes place in the category of local groups and 
one is quotienting not by the entire group (^i) but by an appropriate geometric progression within it. 

Continuing in this vein we pick g2 S Af other than the identity with smallest || • ||e,Ai-norm. We 
shall see that this norm is automatically nonzero, a consequence of the local nature of the quotienting 
operation. 

Continuing further, we pick 53, 34, .... 

All of this makes sense at the level of ultralimits as well, and in this setting one can show that 
Ai has a Hrushovski Lie model Li with dimi^ < dimLi_i for all i. Because of this, the quotienting 
procedure terminates in finite time with an element gk and one concludes by reversing these finitely 
many quotienting operations that A is controlled by an ultra coset nilprogression with "generators" 
H, gi, . . . , gk, thereby leading to a proof of Theorem 14.21 

This concludes our brief summary of the argument. Let us summarise the content of the remaining 
core sections of the paper. 

• In Section [5l we discuss results from multiplicative combinatorics, essentially due to Sanders 
and Croot-Sisask, which are relevant to the proof of Hrushovski's Lie model theorem. 
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• In Section |6l we prove the Hrushovski Lie model theorem. 

• In Section [7j we use the Hrushovski Lie model theorem to construct strong approximate 
groups. 

• In Section [HI we state and prove Gleason's lemmas. 

• In Section |9l we give details of the inductive strategy outlined above for constructing H and 
gi, . . . ,gk, and conclude the proof of Theorem 12. 101 (except for the rank bound). 

• In Section [TU] we show that the rank and step of the coset nilprogression can be bounded by 
6 log2 K in the global case. 

• Section [Tl] is devoted to various applications to the growth of groups and to Riemannian 
geometry. We prove there the corollaries stated in the introduction. 

5. Sanders-Croot-Sisask theory 

In the next section we will establish Hrushovski's Lie Model Theorem (Theorem I3.10p . in which 
an ultra approximate group is related first to a locally compact metrisable local group and then, via 
Goldbring's solution [23] of the local Hilbert's Fifth problem, to a local Lie group. In locally compact 
metrisable local groups we have total boundedness, which means that the unit (say) ball B(id, 1) 
{x e G : c?(a;,id) < 1} may be covered by 0^(1) smaller balls B{xi,e) := {x e G : d{x,Xi) < e}. On 
the other hand, by continuity of the group operation, B{id, 1) will contain high powers like i3(id,e)^'"^ 
for suitably small e. 

It is not surprising, then, that we need tools for showing (roughly speaking) that approximate groups 
A contain high powers of somewhat smaller, but still quite large, approximate subgroups A', which do 
not immediately escape A in the sense that {A'Y" is contained inside A (or perhaps a slightly larger set 
such as A^) for a reasonably large value of m. Such a tool is provided by a result from multiplicative 
combinatorics due to Sanders |3S] and to Croot-Sisask [T^l Theorem 1.6], namely Theorem 15.31 below. 
We shall also need a "normal" variant of this result, which essentially follows by combining Theorem 
15.31 with |48[ Lemma 13.1]. Our version of this is Theorem 15.61 below, and once again we provide a 
self-contained proof. 

Let us remark that by appealing to these results from multiplicative combinatorics we differ fairly 
substantially from the approach taken by Hrushovski [32 , although one may perceive structural simi- 
larities in the model-theoretic arguments he uses. 

All of the results below are essentially already in the literature, but always for subsets A of some 
ambient (global) group G. As it turns out, though, the proofs of these results end up being equally 
valid for the more local setting of multiplicative sets. Indeed, most of the tools used in multiplicative 
combinatorics (with the notable exception of the Fourier transform) are already "local" in nature in 
that they only require one to do 0(1) multiplications. 

Our first such tool is Ruzsa's covering lemma, which essentially allows one to select a "complete set 
of coset representatives" in the approximate group setting. 

Lemma 5.1 (Local Ruzsa covering lemma). Let A,B he finite sets, and suppose that AU B is a 
multiplicative set. Then there exi sts a finite set X C B with \X\ ^ and B C A'^X. Similarly 

there exists a finite set Y C B such that \Y\ ^ and B C VA^. 
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Proof. Let X be a subset of B such that the sets A ■ x ior x E X are disjoint, and such that X is 
maximal with respect to set inclusion; then we have \X\ ^ If 6 e B, then A ■ b and A ■ x 

must intersect for some a;, thus a ■ b — a' ■ x for some a, a' £ A. Multiplying on the left by a~^, we 



A corollary of this is the following result, which allows one to produce an approximate group from 
a set with small growth. 

Corollary 5.2. Let A be a symmetric multiplicative set, and suppose that \A^\ ^ -?^|^|- Then A^ is a 
2K -approximate group. 

Proof. Clearly A^ is a symmetric set containing the identity. Since \A^\ ^ K\A\ ^ ii'lA^I, we see from 
Lemma [5.11 that there exists X C A'^ with \X\ ^ K such that C A^X, and there similarly exists 
r C A" with \Y\ s^^K such that A^ C YA"^. Taking the union of X and Y we obtain the claim. □ 

We turn now to the result of Sanders ^ that drives our whole approach. 

Theorem 5.3 (Small neighbourhoods). Suppose that A is a K -approximate group, and let m ^ 1 be 

an integer. Then there is a OK.m{^)-0'PP'roximate group S with \S\ ^K,m \A\ such that S*™ C A^. 

Remark 5.4. Explicit bounds for the implied constants are given in, for example, |12[ Theorem 1.6]. 
As much of the remainder of the argument is not explicitly effective with respect to bounds, we do 
not worry about such quantitative issues here. Similar remarks can be made in connection with the 
normal variant. Theorem 15.61 below. 

Proof. We use the argument from [JB], generalised to the setting of multiplicative sets. For the con- 
venience of the reader, we reproduce it here. A somewhat different proof of Theorem 15.31 can also be 
obtained by using the techniques of [T^ . 

For each < t < 1, let f{t) denote the quantity 



Since \A^\ < K\A\, we have 1 < f{t) ^ K for all < i < 1. By the pigeonhole principle, we can thus 
find t ^K,m 1 such that 



Fix this t. As there are only finitely many sets B that make up the infimum for /, we can find a B G A 
with \B\ ^ t\A\ such that 



conclude that b — a ^ ■ a' ■ x, and the claim follows. 



□ 




(5.1) 




(5.2) 



\AB\ = f{t)\A\. 



For each a e A, the set Ba has cardinality \B\ and is contained in A^. 
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and hence by Cauchy-Schwarz we obtain 
The left-hand side can be rewritten as 

^ ^ \BanBa'l 

a^A a'GA 

and so by the pigeonhole principle, there exists oq G A such that 

aeA 1^ I 

Since |B| ^ and jA^j ^ K\A\, wc thus have 

^|i?anBao|;^-^|Ap, 

aeA 

and hence we can find a subset C of A of cardinality 
(5.3) \C\^t'/2K^\A\ 

such that jSa n Baa\ ^ t'^|yl|/2iir-^ for all a £ C. Multiplying by a^^ and by a~^, we see that 
\Bh n S| ^ t2|yl|/2iC2 for all h e 5*0, where So := U C^^ao U {id} is a symmetric subset in A^ 

containing the identity. From (|5.ip . we conclude that 

AiBknB)\^{l-^)m\A\. 

From (|5.2p . we conclude that 

\ABhr\AB\ ^ (l —] \AB\. 



lOOm, 

Using induction (and the hypothesis that A^ is well-defined, noting that B C A and Sq C A'^) we then 
see that for any 1 ^ m' < lOOm, the set S"™ is well-defined and 

m' \ 



\ABhnAB\^[l-^^]\AB\. 



for all h £ Sq^ , which in particular implies that S"™ C A"^. On the other hand, from (|5.3p we have 
|S'o| ^K.m 1^1- From CoroUarv 15.21 we see that S := Sq is a Oif^m(l)-approximate group. Since 
S™ — Sq™ C A'^, we obtain the first claim of the lemma. The second claim follows by applying the 
Ruzsa covering lemma (with B := Sq). □ 



Remark 5.5. Let us pause to note a consequence of this result. We defined multiplicative sets to be 
ones in which one was at liberty to take up to 100 multiplications (i.e. A^^" is well-defined), and 
the associative law would hold to this extent. Theorem 15.31 or more accurately a close examination 
of the proof of it, says that if A is an approximate group and a multiplicative set in which merely 
8 multiplications are allowed (i.e. A^ is well-defined) then A is Om.K(l)-controlled by an 0^,^(1)- 
approximate group A' = S in which up to m multiplications are defined an associative. For this reason 
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Theorem 12.101 holds if only 8 multiplications are allowed. We shall not dwell on such details further in 
this paper, allowing ourselves the luxury of 100 multiplications. 

We turn now to proving a "normal" variant of Theorem [531 Here, we use the notation 

b-^ab 

and 

:= {a'' : a e A,b G B} 
for elements a, b and subsets A,Boia. local group. 

Theorem 5.6 (Small normal neighbourhoods). Suppose that A is a K -approximate group, and let 
m ^ 1 be an integer. Let S ^ A* be a K' -approximate group with \S\ — S\A\. Then there is an 
Om.K.K' ,s{^)-0'Pproximate group S with \S\ :>K.K'.rn,s \A\ such that {S"')^ C 5-4. 

Theorem 15.61 will be deduced from Theorem 15.31 To motivate the argument, let us first recall a 
standard lemma from group theory. 

Lemma 5.7. Let A be a finite group, and let S be a subgroup of A with \S\ ^ \A\/K. Then there 
exists a further subgroup S d S of A with \S\ 3>a' |^| which is normal in A. 

Note that this lemma would easily yield Theorem 15.61 from Theorem l5.3l in the special case when A 
and S are genuine groups and not merely approximate groups. 

Proof. Let xi, . . . ,Xk be a complete set of right coset representatives for S in A, and set 

S — ^x^^Sxi — x^^Sx. 

All the claims of the lemma are immediate, except for the claim that \S\ \A\. However, this follows 
from iterating the fact that ii Hi, H2 ^ G are subgroups of small index in a group G then so is i/i n i/2 ; 
in fact we have the well-known inequality 

(5.4) [G:HinH2]^[G:Hi][G:H2]. 

□ 

To adapt this argument to the approximate setting we need an analogue of (|5.4p for approximate 
groups. This is provided by the following lemma. 

Lemma 5.8. Suppose that A is a K -approximate group and that Ai, A2 C A are sets with \Ai\ = 
Then AiA^^ D A2A'j^ contains a set BB~^ with B C A and \B\ ^ diS2\A\/K. 

Proof. Since A'^^ A2 C , we have |Aj~^A2| ^ if |^|. It follows that there is some x with at least 
(5i(52|^|/if representations as a{^a2. Let B be the set of all values of 02 that appear. Obviously 
BB~^ C ^2^2^^- Suppose that 02,02 G B. Then there are ai,a'j^ such that x = a^^a2 ~ (a'j^)^^a2, and 
so a^fl]^^ = a2'^2^^- Thus BB~^ lies in A\A\^ as well. □ 

By iterating the above lemma we obtain the following corollary. 
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Corollary 5.9. Suppose that A is a K -approximate group and that Ai, . . . , Ak C A are sets with 
\A,\ ^ 6\A\ for each i. Then \f]'^^^ A,Ar^\ :^s,k,K \A\. 

Now we can prove Theorem 15.61 

Proof of Theorem \5M By Theorem 15.31 there is an 0/,i<-,i<-/(l)-approximate subgroup 5*0 C S'^, 

|5'o| ':>m,K,K',5 \A\, 

such that 

(5.5) 5^5™+* C S^. 

The Ruzsa covering lemma allows us to do the analogue of picking a complete set of coset representatives 
in the approximate group setting. Specifically, there are xi, . . . ,Xk, k = Om,5,K{^), such that 

k 

(5.6) A^c\JS^x,. 

1=1 

Let us assume without loss of generality that xi — id. 
By Corollarv 15.91 the set 

k 

i=l 

has cardinality 

We claim that the set S := has the required properties. First of all note that, by Corollarv 15.21 S 
is indeed an Om,ic:,if', (5(l)-approximate group. 
Next observe that, since xi = id, 

(5.7) x-^Tx^ C Si 
for each i. 

Suppose that x G A'^. Then, by (|5.6p . there is some i with 1 ^ i ^ fc and some s E Sq such that 
X = sxi. It follows from this, (15.71) and (15. 5p that 

x-^S"'x = x-^T'^"'x = s-^x-^T^'^x^s = s-^ix-^Tx^f'^s C S^"'+^ C S^. 

This concludes the proof. □ 

6. Proof of the Hrushovski Lie model theorem 

In this section we establish Theorem l3.101 The reader may wish to reread ^ which gave an overview 
of this theorem. We will deduce this theorem from the following two propositions. 

Proposition 6.1 (Locally compact model). Let A he an ultra approximate group. Then admits a 
model TT : A'^^ G by a metrisable locally compact local group G. 

Proposition 6.2 (From locally compact models to Lie models). Let A he an ultra approximate group 
and suppose that A"* admits a model tt : A^^ G into a locally compact local group G. Then there 
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is a large ultra approximate group A of A (thus A* C A^) which admits a model tt : — >■ L into a 
connected, simply- connected Lie group L. 

It is clear that the above two propositions together imply Theorem 13. 101 

We will give a self-contained proof of Proposition 16. 11 using the multiplicative combinatorics results 
of the previous section, together with the countable saturation property of ultraproducts. In contrast, 
the proof of Proposition 16.21 requires deep material related to (the local version of) Hilbert's fifth 
problem, for which we provide suitable references. 

Building metrics on local groups. We now begin the proof of Proposition 16. II Suppose that 
we have a pseudometric d : G x G [0, oo) on some local group G, that is to say d satisfies the axioms 
of a metric, except that we may have d{x, y) — when x ^ y. Then we may of course define the 
balls _B(id,e) :— {x E G : d{x,id) < e}, and these will be nested in the sense that _B(id, e) C _B(id,£') 
if e < e'. We now examine ways to reverse this construction, beginning with a quite general way to 
construct pseudometrics on symmetric local groups; this will be needed to prove Proposition 16. II 

Let G be a symmetric local group. For any function ijj : G ^ R and g E G, we define the shift 
Tg^ iG^Rhy setting 

Tgip{x) := ipig^'^x) 

if g^^x is well-defined in G, and Tgilj{x) — otherwise. We then define the "derivative" operator 

9gV := V' - Tgip. 

The expression 

Wdgi'h^iG) ■■= sup \dg'lp{x)\ 

can be viewed heuristically as a "norm" of g relative to tp, and this makes it natural to consider the 
function 

(6.1) d{g, h) := \\Tg%l) - T/jVIU-CG) = l|9;,-ig-0ll^~(G)- 

One can view d as the puUback of the metric on £°°{G) to G using the translation action g i— > Tgtjj of 
G on tp. 

Lemma 6.3 (Using functions to build (pscudo-)metrics). Let G be a local group, and let A be a 
symmetric neighbourhood of the identity such that A^"^^ is well-defined in G. Let ip : G —?' M. be 
non-negative and supported on A. 

(i) We have ||9g?/'||£«=(G) ^ IIV'I|£°°(G) for all g E A^"^^ , with equality holding when g ^ A^. 

(ii) Whenever g,h E A^"^^ , one has 

(iii) For any g E A^"^^ , we have 

(6.3) ||9g-ll/^||£oo(G) = \\dgi}\\i^^G)- 
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(iv) The function d : A'^'^ x A^^ R+ defined by the formula ([63]) is a left-invariant pseudo-metric 
on A^^. 

Remark 6.4. To spell out what we mean in (iv), we are asserting that d{g, g) = 0, that d{g, h) — d{h, g), 
and that d{g, k) ^ d{g, h) + k) for all g,h,k G A^*. Furthermore it has the left-invariance property 
d{gh, gk) = d{h, k) whenever h,k G A^'^, g e A^'^^, and gh, gk e A®"*. Later on, when proving Gleason's 
lemmas, we shall require some slightly more exotic properties of these cocycle "norms" , related to 
commutation and a certain "Taylor expansion" . 

Proof. The property (i) is clear from construction. For g,h G A^'^^ we have the representation property 
TgThtp = Tghip and hence the cocycle identity 

dghip = dgi) + Tgdh^p 

which gives (|6.2p . 

Similarly, for g G A^'^^ we have the inverse identity 

which gives (|6.3p . 

The claims in (iv) follow easily from (ii) and (iii). □ 

In the next lemma we give a variant of the Birkhoff-Kakutani construction [39, §1.22], in which 
a function ip is constructed so that the pseudometric d{g,h) = ||9h-ig''/'l!f°°(G) is adapted to a given 
nested sequence of symmetric sets which are supposed to resemble "balls" in this pseudometric. 

Lemma 6.5 (Birkhoff-Kakutani construction). Suppose that G is a local group and that we have a 
sequence of symmetric neighbourhoods Aq, Ai, . . . of the identity in G with the nesting property that 
^i+i ^ for z = 0, 1, 2, . . . , and with Aq'^'^ well-defined. Then there is a pseudometric 

d-.Al'^x Al^ -> [0, 1] 

such that we have the inclusions 

(6.4) {g e : d{g, id) < 2-''^} C A^ C {g g A^* : d{g, id) ^ 2 ■ 2-^'} 

for all nonnegative integers k. In particular the pseudometric d if and only if, for each 

fc e N, we have x~^Xn G A^ for all sufficiently large n. 

Proof. Suppose that q = 2^*^ + • • • + 2^'*=, < q < 1, is a dyadic rational, and define 

Bq . Aj^^Ai^ -^ . . . Ai-^. 

Even though the definition uses a potentially large number k of multiplications, the nesting property 
of the Ai means that these sets Bq are well-defined in the local group G. 

We claim that Bq C Bqj_2-k whenever g is a dyadic rational with denominator dividing this 
easily implies that 

(6.5) Bq C Bq> whenever Q < q < q' <\. 
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The claim follows by repeated use of the nesting Af^^ C Ai (the number of times it will be required is 
the number of carries when 2"*"' is added to q in binary). 

In particular, Bq C Ai^^^i C ^o- 

Define i/. : AJ]* ^ [0, 1] by 

ip{x) sup{l ~ q:0 <q<l;x e Bq} U {0}, 

and consider the pseudometric d{g,h) \\dh-ig'il'\\i^{G) discussed in Lemma 16.31 Note that for 
g,h G Aq^, dfi-igip is supported in Aq^^, and so one can replace i°°{G) here with A^^'^ if desired. 

If d(5,id) < 2^*^ then |9g^(id)| < 2*^, which implies that Tpid) > 1 - 2^'^ and therefore g € B2-k 
and hence g G Ak- 

Conversely, suppose that g £ Ak- we are to show that d{g,id) ^ 2 • 2^*^. To show this we must 
confirm that \dg'ip{h)\ < 2^^^ for all h E G. As discussed before, we may assume that h S A})^^. 
Suppose that h E Bq, where < q < 1 — 2^*^ is an integer multiple of 2^*^, but that h ^ Bq_2->'- Then 
ipih) ^ 1 — q + 2^^ . On the other hand, g^^h £ AkBq C Bq^2-''^ by the claim established above, 
and therefore i){g^^h) ^ l-q- 2"'=. It follows that dgTp{h) = ipig^^h) - Tp{h) > -2 • 2"'=. Similarly, 
dg4>{h) ^ 2 • 2^*^. Since h was arbitrary it follows that d{g,\6) = ||9gV'||£°=(G) ^ 2 • 2^*^, and the claim 
follows. □ 

If the sets Ai satisfy a certain normality condition, the group operations are continuous with respect 
to the pseudometric d: 

Lemma 6.6 (Normal Birkhoff-Kakutani construction). Suppose that G is a local group and that we 
have a sequence of symmetric sets Aq, Ai, . . . in G with Aq'^^ well-defined and with the nesting property 
that {Af_^_i)^o C Ai for i = 0,1,... (and so, in particular, we certainly have the weaker nesting 
property Af^-^ C Ai required by the preceding lemma). Consider the pseudometric d : Aq^ x Aq"^ [0, 1] 
defined in the preceding lemma. Then the product map inversion map 

: — !■ j4q^ are both continuous with respect to d. 

Proof. Suppose that gn — g and that hn — )■ h. We wish to show that gnhn — >■ gh, to which end it 
suffices to establish that [gh]^^ gnhn G Ak for all sufficiently large n. However, for n sufficiently large 
in terms of k we have g^^gn G Ak+2, and hence 

-1 -1 h A^°° 

K 9 9nhn e A^.+2 C A^^2 ^ ^^+1- 

Furthermore, h^^hn G Ak+i for n sufficiently large, and so 

{ghy^gnhn = {h^^ hn){hn^ g^^ gnhn) e A\^^ e Ak, 

as required. The statement about the inverse map is easier. Suppose that g„ — >■ g. Then g^^gn G Ak+i 
for n sufficiently large, and so 

9ng = 9(9 9n)9 e c A^^l^ £ Ak. 

But this means that —> g as ri —> cx). □ 

The previous lemma showed how to get a local topological group given a sequence of balls satisfying 
a suitable normalisation condition. The normal variant of the Croot-Sisask-Sanders lemma. Theorem 
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15.61 allows us to find precisely such a sequence of balls given any if-approximate group A. Of course, 
these balls are just finite sets and, for sufficiently large i, Ai may well consist only of the identity 
element e. This will be the case, for example, when A — [~N,N]. However when transferred to 
the setting of an ultra approximate group A = rin-s-a^"' these balls have "finite index" in A, and 
this ultimately leads to the important conclusion that the metric d gives A the structure of a locally 
compact local group. 

Lemma 6.7. Let A be an ultra approximate group. Then there is a sequence of ultra approximate 
groups Aq, Ai, . . . such that Aq — , we have the nesting property that {Af_^^)-^<> C A^ for i — 
0, 1, ... , and each A^ is large in the sense that A can be covered by finitely many left-translates of Ai. 

Proof. By definition, one has A — Yin-fa some _?C-approximate groups An and some fixed K. 

Applying Theorem 15.61 repeatedly we see that there are, for each n, 0_R-,i(l)-approximate groups Sn.i, 
i = 1, 2, 3 ... , such that Sn,o ■— An and (S',^.i+i)'^"''' Q S^ ^ for each i. Furthermore we have Sn.i C A^ 
and |S'n,i| ^K,i \An\ for each i. Setting 

:= Y\. ^nA^ 
n— fa 

all of the properties except the assertion about covering are immediate. To check that each A^ is large, 
we need only check that Sn.o is covered by OK.i{^) left-translates of S^ i, for each i. This, however, is 
an immediate consequence of Lcmma l5.1l and the lower bound on |S'n,i|. □ 

Lemma 6.8. Let A be an ultra approximate group. Consider a sequence of ultra approximate groups 
Ao,Ai,... as found in the preceding lemma, and let d : A^^ x A"^^ [0; 1] be the pseudometric 
associated to these sets as in Lemma 1 6'. 51 Then A"^^ is locally compact with respect to the topology 
generated by d. 

Proof. By the Heine-Borel theorem (which is usually stated for metrics, but which extends without 
difficultjtj to pseudometrics) it suffices to show that A'^^ is complete and totally bounded. We deal 
with the latter task first. From the inclusion Aj, C {x : d{x, id) ^ 2 • 2~'^} and the left-invariance of d, 
this follows from the fact that A^^ is covered by finitely many left-translates of Afc. 

We turn now to completeness. Suppose that (a;„)„gN is a Cauchy sequence. By refining the sequence 
if necessary we may assume that it is rapidly Cauchy in the sense that d{xn, Xm) ^ 2^"^^. 

We claim that the sets a;„A„ are nested in the sense that XmAm Q XnAn whenever m > n. To 
see this note that by left-invariance we have d{id,x~^Xm) ^ 2^"^^ and hence, by the inclusions of 
Lemma 1^31 x~^Xm G A„+i. Since A„+iAm C Af^^^ C A„, it follows that x~^XmA„i C A„, thereby 
confirming the claim. 

Now each set XmAm is an ultraproduct Jln-i-Q ^ra,m by construction. The nesting property just 
established of course implies that, for any positive integer M , nm<A/ XmAm 7^ 0. Let j/m be an 
element of this intersection; this means that there is a set E^v/ G ct such that {yM)n € C\m<M ^m,n for 
all n e Sjv/. By replacing S2 with Si H S2 if necessary, and so on, and using the basic properties of 



Indeed, one can deduce the pseudometric case from the metric case by quotienting out by the equivalence relation 
X y defined by the equation d(x, y) = 0. 
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ultrafilters, we may assume that Si I) S2 ^ S3 ^ . . . . By removing 2 from S2, 3 from E3 and so on, 
if necessary, we may also assume that no integer hes in infinitely many 'Sm- 

Now define a sequence x by setting x„ = {yM)m where M is the largest integer for which n e Sm- 
Then, by construction, a;„ G C\rn<M ^rn.n for all n £ Sm, that is to say for a set of n tending to a. 
This means that x G Hm^M fo"^ every M, and hence x G f]xmA.m- In particular we have 

x^^x G Am for every m and hence d{x,Xm) ^ 2 • 2^™. It follows that Xm ^ x, thereby confirming 
that A is complete with the metric d. □ 

Remark 6.9. The last part of this argument, in which an element is found in the infinite intersection 
riin'^™-^™ given that each finite intersection Pl^^^^XmA^ is nonempty, is an instance of the count- 
able saturation property of the ultraproduct construction. The completeness that is afforded by the 
countable saturation property is one of the main reasons why we work in the ultraproduct setting. 
Note that a similar completeness also appears in the ultralimit {X, d)/ ^ oi bounded metric spaces 
{Xm dn), where X := Jln-s-a ^ stlimn-s-a dn, and ~ is the equivalence relation defined by setting 
X ^ y whenever d{x,y) = 0. Indeed, it is not difficult to use countable saturation to verify that such 
ultralimits are automatically complete, even if the original spaces Xn are not. 

Proof of Proposition \6.1i We have shown that A'^^ has the structure of a locally compact local group 
with respect to the metric d. To complete the proof of Proposition, we need only quotient by the 
equivalence relation ^ on A'^^, defined by a; ^ y if and only if d{x,y) = 0. The quotient L := A"^^/ ^ 
is then a metrisable, locally compact, local group and there is a natural map tt : A'^^ — > L. We must 
check that i is a good model for A** in the sense of Definition 13.51 

Property (i) requires us to show that there is some open neighbourhood Uq of the identity in L such 
that TT~^{Uo) C A^ and Uq C 7r(A'^), or in other words some ball {x G A^^ : d{x,id) < e} lies in A^. 
This again follows from (|6.4p and the fact that each of the sets Af. constructed in Lemma 16.71 lies in 
A^. 

Finally, property (ii) in the definition of good model requires us to show that vr(A^) is compact. 
This is immediate. 

To prove property (iii) , we first establish the following weaker property: 

(iii)' : for any open neighbourhood U of the identity in L there is some U' C U and some ultra 
finite set A' = n„^a K with n^\U') C A' C n-\U). 

This is quite easily established: suppose that U contains the ball -B(id, 2^*^). Then it follows imme- 
diately from the inclusions of Lemma 15751 that we may take A' :— A^^i and then U' := B{id, 2^'''^^). 

We now upgrade this to property (iii) in the definition of good model. Suppose that F C U C f/o 
with F compact and U open. Then there is some open neighbourhood of the identity [/' such that 
FU' C U. Applying (iii)', we may locate a further open set U" C U' and an ultra finite set A' such that 
n^^{U") C A' C TT^^{U'). By compactness there are elements xi, . . . ,xm such that F C lJm=i ^mU"; 
we may assume that these elements lie in F{U")^^ — FU" C [/ C Uq, and hence each is of the form 
Xi — 7r(ai) with Ui G A. To conclude the proof of property (iii) simply take A" := lJm=i ^.mA' . This 
completes the proof of Proposition 16. II □ 
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To complete the proof of Theorem l3.10[ we invoke results about Hilbert's fifth problem, and specif- 
ically the structural theorem of Goldbring [22] describing locally compact local groups, which we state 
as Theorem IB. 181 in the appendix. 

Proof of Proposition 1 6. 2\ Suppose that we have a model it : A^'^ G from A''^ to a locally compact 
local group G, and let C/q be the open neighbourhood of the identity featuring in the definition of good 
model (Definition 13. 5p . thus tt^^^Uq) C A'' and C/g C 7r(A^). By Theorem IB. 181 there are symmetric 
neighbourhoods U2 ^ Ui C Uq C G with [/|^ C t/i (say) and a compact normal subgroup H of U2 
such that Ui/H is isomorphic to a local Lie group L. Let (j) : Ui ^ Ui/H be the projection map. 

By property (iii) of Definition 13.51 (applied to tt : A — > G) there is a symmetric ultra finite set 
A C A** with 7r^^([/|) C A C ir^^iU^)- Certainly, the map tt := o tt is well-defined and gives a 
homomorphism from A® to L\ since 7r^^([/2)'* C 7i'~^(C^2^) C A'', we have A* C A^, and by Remark 
13. 7| A is an ultra approximate group. We verify that this is a good model by checking (i), (ii) and (iii) 
of Definition 13.51 in turn. For (i), first note that 7r(A) contains Uq := (j>{U2) — U2H/H C L, which is 
an open neighbourhood of the identity in L since U2H C G is open. 

Furthermore we have 

n-\Uo) = 7r-V"V(C/2) C n-\U2H) C ^-^{UD C A. 
Turning to (ii), 7r(A) is contained in the compact set (/)(C/|). 

Finally, we check the "approximation by internal sets" property, which is (iii) in Definition 13.51 
Suppose that F C U C Uo, with F compact and U open. Then (j)~^{F) — FH is compact, whilst 
<p^^{U) = UH is open. The approximation by internal sets property then follows from that fact that 
TT : A — > G is a good model. 

Finally, we check that A is a large ultra approximate group. To see this note that 7r(A^) is 
contained in a compact subset of L; therefore there are finitely many elements Xi,. . . ,Xk such that 
7r(A2) C yjLi TT{xk)Uo- It follows that 

k k 

A2 C y Xkn^^Uo C y XkA, 

i=l i=l 

thereby confirming that A is an ultra approximate group. By essentially the same argument, A may 
be covered by finitely many translates of A; thus A is indeed large. □ 

We now record some analogues of the above results in the setting of global ultra approximate groups 
(i.e. ultraproducts of global if -approximate groups for some fixed K) , which are closer to the results of 
Hrushovski [32] • Define a global model tt : (A) G to be the same notion as a good model tt : A^ G 
from Definition 13.51 except that A^ is replaced by the whole group (A) generated by A, and G is now 
required to be a global group rather than a local group. 

Proposition 6.10 (Global locally compact model). Let A be a global ultra approximate group. Then 
A^ admits a global model n : (A) G by a metrisable locally compact global group G. 
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Proof. This is obtained by a modification of tlie proof of Proposition 16.11 The one main change is 



that the nesting condition {Af,-^^)■^o°° c A; appearing in Lemma 16.71 needs to be strengthened to 



, 100(i + l) 



(Af_^j^)^i) C Ai, but this is easily accomplished. □ 

Proposition 6.11 (From locally compact models to Lie models). Let A be a global ultra approximate 
group and suppose that A** admits a global model tt : (A) — > G into a locally compact global group G. 
Then there is a large ultra approximate group A of A which admits a global model n : (A) — > L into a 
connected Lie group L. 

Proof. This is obtained by a modification of the proof of Proposition [^21 The one main change is that 
one needs to replace Theorem IB. 181 with Theorem IB. 171 □ 



Note that in contrast to Proposition |6]2] that we do not assert that the global Lie group L is simply 
connected (as this is not provided by the global Gleason-Yamabe theorem f Theorem IB. 17p . which only 
promises connectedness). And indeed, in general we do not have simple connectedness of the model. 
For instance, if A = {—N,...,N} C Z/IOONZ, for some unbounded nonstandard natural number 
N, then the obvious global model here is the map tt : Z/lOOiVZ M/Z defined by tt{x) = st( -^^^^ ) 
mod 1, and of course the unit circle R/Z is not simply connected. On the other hand, A^'^" = Z/IOOA^Z 
is globally modeled by the trivial group; and so one can still recover simple connectedness by passing 
from A to a suitably large power. See [32l Remark 4.11] for some further discussion of this point, as 
well as Theorem 110.101 below. 

Combining Proposition [6T0] and Proposition 16. Ill we obtain the following result, originally due to 
Hrushovski [32]). 

Proposition 6.12 (Weak global Lie model theorem). Suppose that A is a global ultra approximate 
group. Then there is a large ultra approximate group A of A which admits a global model tt : (A) L 
into a connected Lie group L. 

We will strengthen this proposition in Theorem 110.101 below. 

Remark 6.13. Let tt : A® — > L be a good model for an ultra approximate group A = Tlrn-Q 
a locally compact local group L, and let Uo be the neighbourhood in Definition 13.51 Let Ui be a 
symmetric neighbourhood of the identity such that Ul'^^ C Uq. For any continuous function f : L ^ R 
with compact support in Ui, we can define a functional /(/) by the formula 



/(/) = infst 



|A| 

where — linin^Q, is the ultralimit of functions : An K, with the nonstandard real 
X^asA ^^'^ nonstandard natural number |A| defined in the usual fashion as 



F+{a) hm V F+(a„) 



^ — ^ n— fa ^ — ^ 

aGA a„eA„ 



and 



lim I An I, 
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and the infimum is over all F'^ for which F^{a) ^ /(7r(a)) for all a e A. Using Definition IS-ST iii) it is 
not difficult to also obtain the equivalent formula 



I if) = sup St 



J2aeA 1 

where the suprcmum is over all for which F^ (a) ^ /(7r(a)) for all a E A. From these two definitions 
we see that /(/) is both super-linear and sub-linear, and is thus a continuous linear functional on the 
space Cc{Ui) of continuous compactly supported functions in C/i. By the Riesz representation theorem, 
there thus exists a Radon measure on Ui such that /(/) = Jjj^ f dfi for all / £ Cc{Ui). From the 
translation invariant properties of /(/), we see that iJi{gE) = /i(-E) for any measurable subset E of 
Ui, and any g (z L such that gE are defined in Ui, and similarly for gE replaced by Eg. Thus /i is a 
bi-invariant Haar measure on Ui; since A can be covered by finitely many left-translates of 7r^^(_F) for 
any compact neighbourhood F of the identity, we see that ^ is non-trivial (which implies in particular 
by bi-invariance that the locally compact local group L is unimodular). This Haar measure can then 
be used to estimate the (nonstandard) cardinality of various nonstandard finite sets that are "close" to 
A in some sense. Indeed, from the definitions (and the regular nature of Radon measures) we see that 

/i(F)|AK |A'K m(C/)|A| 

whenever F C U C Ui, F is compact, U is open, and A' is a nonstandard set with 

tt^HF) C a' C n-\U). 

We will not use this measure fi in this paper, but see |32j for some further discussion of this measure 
and its relationship to Kiesler measures from model theory. One can also use fj, to relate the volume 
growth of A™ to the volume growth of the model group L, giving some rigorous substance to some 
of the volume growth heuristics invoked in the examples in Section [3l but we will not formalise this 
relationship here. 

Remark 6.14. As remarked in [32], the Lie Model theorem is not only valid in the context of nonstandard 
finite ultra approximate groups, i.e. the ultraproduct of finite isT-approximate groups for a fixed K, 
but also for "continuous" ultra approximate groups, that is to say the ultraproduct of precompact open 
subsets of a locally compact local group that obey all of the approximate group axioms other than 
finiteness. See [5T] for the basic theory of such continuous approximate groups. Indeed, one can check 
that the machinery in Section [5] can be adapted to this setting by replacing the cardinality of finite 
sets with the Haar measure of various precompact open subsets of a locally compact local group, as in 
|51j . Some other components of this paper, such as the construction of strong approximate groups and 
Gleason metrics, can also be extended to this setting after some minor notational changes. However, 
there will be a key place in the argument^ in Section IH] in which the (nonstandard) finiteness of the 
ultra approximate groups is used in an absolutely crucial way, namely to locate an element in such a 
group element of minimal non-zero "escape norm" . As such, the main result of this paper. Theorem 



^^Another, much more minor, place where ultra finiteness is used in Remark l6.13l above. as we implicitly used the trivial 
fact that counting measure is bi-invariant. In general, one can only conclude that the measure associated to a good model 
is bi-invariant if each of the individual approximate groups in the ultraproduct is also equipped with a finite bi-invariant 
measure. 
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I2.10( does not immediately extend to the continuous setting. Indeed, the basic example of a small ball 
in a Lie group shows that continuous approximate groups need not resemble coset nilprogressions at 
all. We will not pursue this matter further here. 

Some finitary consequences of the Lie Model Theorem. To iUustrate the power of the Lie 
Model Theorem in the analysis of approximate groups, we offer two fairly quick applications. The 
reader interested in the proof of our main results may skip ahead to the next section. 

The first application is a special case of our main theorem (Theorem I2.10p , following Hrushovski 
[Sa Corollary 4.18]. 

Theorem 6.15 (Hrushovski). Suppose that G be a group of exponent m and suppose that A Q G is a 
K -approximate group. Then contains a genuine subgroup H of G with \H\ ^K,m \A\. In particular, 
by Lemma \5.1[ A is covered by 0^,^(1) left-translates of H . 

Remark 6.16. When m — 2 the group G must be abelian, and in this case the theorem is due to Imre 
Ruzsa l44i. 




Proof. Suppose for sake of contradiction that the claim failed. Then we may find fixed K, m and a 
sequence of if-approximate groups An C Gn in groups Gn of exponent m, such that for each n, A'^ 
does not contain a genuine subgroup Hn of cardinality \Hn\ ^ |^n|/n. As usual we form the ultra 
approximate group A := Hn-xi ^n- The ultraproduct group G := On-j-a ^^^^ exponent m, and 
by Hrushovski's Lie model theorem we can find a large approximate group A' C A'* with a local Lie 
model TT : (A')^ — ^ L. By Definition 13. Sf i). we may find a neighbourhood Uq of the identity in L such 
that iT^^{Uo) C A' and Uq C 7r(A'). Using the fact that the exponential map is a homeomorphism near 
the identity of L, we may then find a neighbourhood Ui oi the identity with C/™ C Uq such that Ui 
contains no elements of order m other than the identity. If a G 7r~^(J7i), then we conclude that a™ is 
well-defined in A' with 7r(a)'" — 7r(a™) = id, and so 7r(a) is trivial, which means that tt~^{Ui) — ker(7r). 
As 7r(A') is precompact, we conclude that A' is covered by a finite number of translates of ker(7r); as 
A' is large, A is also covered by M such translates for some (standard) finite M. 

From Definition 13.51 (iii), we see that the set 7r"^([/i) = ker(7r) is a nonstandard finite set, and so 
ker(7r) = Hn-j-a ^'^^ some finite subsets of Gn. Since ker(7r) C A^ is a group and A is covered 
by M translates of ker(7r), we see from Los's theorem (Theorem IA.6|) that for all n sufficiently close to 
ct, H„ C is a group and A„ is covered by M translates of Hn. However if one takes n larger than 
M then this contradicts the construction of An, and the claim follows. □ 

Remark 6.17. The astute reader will notice that the only properties of the local Lie group L that were 
really used in the above argument were that L was locally compact and had the NSS (no small subgroups 
property). Thus, one could prove Theorem 16.151 using a weaker form of the Gleason-Yamabe theorem 
(Theorem IB.17p . in which the model group is merely locally compact NSS rather than Lie. (The 
machinery of Hilbert's fifth problem implies that these two concepts coincide, but this is considerably 
deeper.) However, we do not know of a proof of Theorem 16.151 that avoids the machinery of Hilbert's 
fifth problem completely, and in particular some variant of the Gleason lemmas is required. 
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Next, we prove (a slight variant of) the main theorem from Hrushovski's paper [32[ Theorem 1.1], 
which uses the Lie structure (via the Baker-Campbeh-Hausdorff formula) more thoroughly than the 
preceding application. 

Theorem 6.18 (Hrushovski's structure theorem). Let A be a K -approximate group, and let F : N x 
N ~> N be a function. Then there exist natural numbers L, AI, N with N ^ F{L, M) and L, AI <^k,f 1; 
and nested sets 

{id} C An C ...C AiC A"^ 

with the following properties: 

(i) For each 1 ^ n ^ N , An is symmetric; 

(ii) For each 1 s^n < N, A^^^ C A„; 

(iii) For each 1 ^ n ^ N , An is contained in M left-translates of An+i; 

(iv) For 1 ^ n,m,k ^ N with k < n-\-m, the set [An, Am] '■— {[g,h] : 5 G A„, h G Am} is contained 
in Ak; 

(v) A can be covered by L left-translates of Ai. 

Proof. Suppose this is not the case. Carefully negating all the quantifiers, we conclude that there exist 
K, F and a sequence A^"^ of ii'-approximate groups, such that for each n and each L, M ^ n, there 
does not exist N ^ F{L, AI) and A^"-* , • ■ • , obeying the conclusions of the theorem. 

As usual, we form the ultraproduct A := On-fa ' which is an ultra approximate group. By 
Theorem 13.101 we may find a large ultra approximate subgroup A = Jln-^-a which has a good 
model (j) : ^ L hy a local Lie group. 

Let [ be the Lie algebra of L, and fix an open bounded convex symmetric body B in £. Let e > 
be a sufficiently small (standard) real number depending on B, L to be chosen later; in particular we 
may assume that the exponential map is a homeomorphism from eB to exp(£i?), and that exp(ei?) 
is contained in the neighbourhood Uq appearing in Definition 13.51 For each standard natural number 
n ^ 1, we apply Definition 13.51 and Remark 13.71 to find an ultra approximate group A„ with 

7r"^(exp(10~"eB)) C A„ C 7r"\exp(2 x 10""eB)); 

In particular we have the nesting 

. . . C A2 C Ai C A'*. 
From the Baker-Campbell-Hausdorff formula, we have 

exp(2 X 10-"-hBf C exp(10-"£B) 

if e is small enough, and thus A-n^^ C A„. In a similar spirit, we can find an AI depending only on the 
dimension of L or I such that each ball 10^"ei3 is covered by at most M translates of 4 x 10^"^^ei3, 
which by the Baker-Campbell-Hausdorff formula again implies, for small enough e, that each A„ is 
covered by at most M left-translates of A„+i. Finally, another application of the Baker-Campbell- 
Hausdorff formula reveals that 

[exp(2 X 10""eB),exp(2 x 10"™eB)] C cxp(10"''£B) 

whenever k < n -\- m, and hence [A„, A„i] C A^. 
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Finally, since one can cover 7r(A) by a finite number of translates of exp(£_B), we see that A can be 
covered by at most L left-translates of Ai for some standard L e N. 

Now set A„ = riiwco ^""^ some finite sets A^"' , and set N := F{L, M). Applying Los's theorem 
(Theorem IA.6[) repeatedly (but only finitely many times), we see that for n sufficiently close to a the 
sets a'""\ . . . , A^"*, yl'^"^ obey all the properties in the conclusion of the theorem. This contradicts the 
construction of the A^"^ for n larger than L, M, and the claim follows. □ 

Remark 6.19. One can also use the Lie Model Theorem to establish a stronger statement than Theorem 
16.181 which roughly speaking asserts that given a (finite) if-approximate group A, one can find a large 
sub- approximate group A' which has an approximate homomorphism tt : {A')^ — >■ L into a local Lie 
group L with bounded range that obeys an approximate version of Property (iHn Definition [3?5l where 
the accuracy of these approximations exceeds the "complexity" of the modeo by any given function 
F. The precise formulation of this statement, which is in fact a logically equivalent "finitisation" of 
Theorem 13. 101 is somewhat complicated. We will not need it elsewhere in the paper, and so we leave 
it as an exercise to the reader. 

7. Strong approximate groups 

We now give a combinatorial consequence of the Lie Model Theorem (Theorem l3.10p which will be 
important later, involving a concept which we will call a strong approximate group. 

Definition 7.1 (Strong Approximate Group). Let A be a if-approximate group for some K ^ \. We 
say that A is a strong K -approximate group if it admits a symmetric subset S such that 

(7.1) (^A^)lOOOK^ ^ ^ 

and for which the following two trapping conditions are satisfied: 

(i) (First trapping condition) If g, g^, g^, . . . , g^°°° g A^^^ then g G A; 

(ii) (Second trapping condition) If g,g^, . . . , ^ £ A then g £ S. 

An ultra strong approximate group is an ultraproduct A — J^„_j.o2 of strong /C-approximate 
groups An, for some K ^ 1 independent of n. 

At present this definition will seem somewhat unmotivated, although it can be demystified to 
some extent by remarking that these properties suggest that S and A are behaving like very small 
neighbourhoods of the identity in a Lie group L, with 5* much smaller than A. This point should 
become clearer shortly. The reader should not pay too much attention to exponents such as 1000i4'^ 
or lO^K^ in the definition; they are chosen for the sake of concreteness. 

The main reason for introducing this concept is that we will be able to show, in the next section, 
that the escape norm ||g||e,A (defined in Definition 14. 3p for an ultra strong approximate group A has 
the pleasant properties outlined in 2) There is scope for varying the parameters in the definition of 
strong approximate group, but the ones we have given here are strong enough to prove the desired 
properties of the escape norm. 

^^The complexity, which we do not define here, would be some quantity taking account of the dimension and structure 
constants on the Lie algebra I of L, the diameter of the range of tt and the inradius of the neighbourhood Uq appearing 
in Definition 13. ST iV 
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It is easy to give examples of strong approximate groups. For instance, if A — {—N, . . . , N} (and 
K = 3) then we may take S = {-N', . . . , N'} with N' - N/IOOOK^. If A is a subgroup, then we 
may simply take S = A. On the other hand, if one randomly removes a small number (e.g. N'^-'^^) 
of elements symmetrically from {—N, . . . , N}^ the resulting set is likely to remain a 0(l)-approximate 
subgroup, but not a strong 0(l)-approximate subgroup. 

The main result of this section implies the following. 

Proposition 7.2 (Finding a ultra strong approximate group). Let A be an ultra approximate group. 
Then there is a large ultra approximate subgroup A o/ A which is a strong ultra approximate group. 

For use in fJS]we will require the following somewhat more precise result. 

Proposition 7.3 (Balls are ultra strong approximate groups). Let A be an ultra approximate group 
with a good model n : A ^ L to a local Lie group L. Let B be an open bounded convex symmetric 
subset of the Lie algebra I of L. Then there exists a standard radius ro > such that for allQ < r < r^, 
any symmetric nonstandard finite set A with 

(7.2) 7r"^(exp(rB)) C A C 7r"i(exp(2rB)) 

is a large strong ultra approximate subgroup of A.. 

It is clear that Proposition 17. 21 follows from Proposition [7?3l Theorem l3.10[ and Definition 13.51 (iii): 
we will, however, only need Proposition 17. 31 in the sequel. 

We now prove Proposition 17.31 Let tq > be a sufficiently small quantity depending on A, tt, L, B 
to be chosen later; in particular, we take rg so small so that the exponential map is a homeomorphism 
from 2ro-B to exp(2roi?), and exp(2ro-B)"'^°" is contained inside the open neighbourhood [/q of Lq from 
Definition 13.51 

Let A be as in the proposition. In particular A^"" C A. By Remark 13.71 we may take A to be a 
ultra ii'-approximate group for somc0 K, and therefore an ultra approximate subgroup of A. Since 
7r(A) is precompact, it may be covered by finitely many left-translates of exp(r_B), and so A can be 
covered by finitely many left-translates of A. Thus A is a large ultra approximate subgroup of A. 

It remains to establish that A is a strong ultra approximate subgroup. Suppose that g G A^*'^ is 
such that g, . . . , (^^"O" g Applying tt, we see that 

n{g),...M9r''^exp{2rBr'. 

Working in exponential coordinates and using the Bakcr-Campbell-HausdorfF formula we conclude, if 
ro is small enough, that 7r(g) G cxp(r_B) and thus g G A. We have thus shown the first trapping 
condition for A. 

Next, we use Definition 13.51 to find a symmetric nonstandard finite set S with 

exp{10-^ R-^rB) CSC cxp{10-^K-^rB). 
From the Baker-Campbell-Hausdorff formula we see that 

(exp(10-*i^-VB)^'^P(2'-^)')i"0"^' C exp(rB) 

^^Indeed, by using the Baker-Campbell-HausdorfT formula one can take K to depend only on the dimension of L, if rg 
is small enough, but we will not need this fact here. 
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and thus 

(7.3) ((S)^')i°o°^' C A. 

Finally, suppose that 5 € A is such that g, . . . , g^^''^^ g A. Applying tt, we conclude that 

^(5),...,^(.9)i°'^^" eexp(2rB). 

Working in exponential coordinates, we conclude that TT{g) e exp{10^^ K^^rB) and hence 5 € S. Thus 
we have verified the second trapping condition for A. 

Finally, we need to push the trapping conditions from the ultraproduct A back to the finitary 
setting. Write A = Hn-^a ^it> A = On^a and S = for some finite sets A„,A„,S„. By 

Los's theorem ("Theorem IA.6[) . we see that for n sufficiently close to a, An is symmetric and contains 
the identity with A^ C A^, with A^ covered by K left-translates of An, that 

and that the first and second trapping properties hold for An and Sn- Thus we see that An is a strong 
/^-approximate group for n sufficiently close to a. After redefining An suitably for all other values of 
n, we conclude that A is an ultra strong approximate group as required. This concludes the proof of 
Proposition [73] and hence Proposition 17.21 

Remark 7.4. This proposition represents by far the most serious use of Hrushovski's Lie Model Theorem 
in our paper. Although we use that theorem elsewhere in the paper, it is only for this proposition that 
we do not currently have a plausible alternative approach. 



8. The escape norm and a Gleason type theorem 

In this section we prove a variant of "Gleason's lemmas" in the setting of approximate groups. 
These show that if A is a strong approximate group then the escape norm has pleasant properties with 
respect to product, conjugation and commutation. The role of these lemmas was briefly discussed in 

SI 

Here is a precise statement. 

Theorem 8.1 (Gleason-type theorem). Suppose that A is a strong K -approximate group. Consider 
the escape norm 

\\g\\e A inf | — — : n e g'' e A for all ^ i ^ n \ , 
[n + 1 J 

with the convention that \\g\\e.A ~ 1 when g is undefined. This has the following properties: 

(i) (Conjugation) If g,h £ A^^ then Wg'^We.A < 1000||.g||e,A; 

(ii) (Product) We have \\gi . . . gn\\e^A ^ K°^'^W\gi\\^^A + ■ ■ ■ + \\gn\\e,A) if gi, . . . , gn & A^° ; 

(iii) (Commutators) If g,h £ A^^ then we have \\ [g, h]\\e,A ^ llylle.All^lle.A- 
Note that, as a consequence of (i) and (ii), the set oi g £ A with ||5||e,A = is a subgroup normalised 

by A^°. 
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Remark 8.2. Note that this lemma is trivial when the ambient local group is abelian. For that reason, 
this section can be ignored by those readers interested in seeing our alternative proof of the (abelian) 
Freiman's theorem. 

Motivation and heuristic discussion. We will shortly give a self-contained proof of Theorem 18. 1[ 
but as motivation we first offer some comments and discussion of the context in which these ideas were 
first invented: the solution of Hilbert's fifth problem by Gleason, Montgomery- Zippin and Yamabe 
[T9l [201 [38l [39l [59l l60] (see also |22] for the local group analogue of these lemmas). 

In that context, the Gleason lemmas show the existence, in an arbitrary locally compact group G, 
of arbitrarily small compact neighborhoods A of the identity whose associated escape norm satisfies 
properties (i) to (iii) as above. The Gleason lemmas lie at the heart of Hilbert's fifth problem and are 
used at several places in its proof, both in the reduction step from general locally compact groups to 
NSS (No Small Subgroups) groups, and in order to deal with NSS groups. 

For example, if G is NSS, the Gleason lemmas are needed in order to establish that the set of one- 
parameter subgroups of G forms a vector space. If X{t) and Y{t) arc two one-parameter subgroups, 
then a natural candidate ioi X + Y is \\v[\n^+oo{X [t / n)Y [t / n))'^ . In order to show that such a limit 
does exist, the bound (ii) on the escape norm of a product is precisely what is needed. For the full 
story, the reader may wish to consult the classical references [331 133 , the more recent non-standard 
treatments of the Gleason lemmas by Hirschfeld [31] and by Goldbring and van den Dries [56] , or the 
blog posts of the third authoi0. 

To give a fiavour of how the Gleason lemmas are proven, let us discuss a simple case of the product 
estimate, namely 

(8.1) \\uv\\,,A^C{\\u\l,A+\\v\\e^A)- 

Here, A is a ball i3(id, 1) about the identity in a locally compact group G with the NSS property, where 
the ball is with respect to some left-invariant distance d, and C is some finite quantity depending on 
A. In the discussion below we will make use of the following points concerning this situation: 

(i) We may construct a distance d with the additional property that d(id,xf) ^ Cd{\d,x) for 
g,x E B{id, 2) (for example by the Birkhoff-Kakutani construction [39l §1.22]). 

(ii) The balls in G enjoy an escape property quite similar to that in the definition of a strong 
approximate group. More precisely, given e > there is an G N such that ii g,g^, . . . , g^^' G 
B{id, 1) then g E i?(id,e). The proof of this is by contradiction - taking a limit of putative 
"bad" gs, one can contradict the NSS property. 

The key idea behind the proof of the product estimate ()8.1|) is to relate the escape norm ||g||e,A 
to the auxiliary quantity ||9g5'||oo, where dg'i'{x) = 'i>{g^^x) — 'i>{x) and ^I^ is a non-negative "bump" 
function supported on i?(id, 1), let us say with ||5'||oo — ^'(id) — 1. As noted in Lemma [6.3[ such a 
"norm" automatically satisfies the product inequality (with C — 1), and so we need only show that 
||ff||e,A ^ ||5g^||oo in a suitable sense, and for a suitable VP. 



|http: //terrytao ■ wordpress . com/tag/hilberts-f if th-problem/| 
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In one direction, it is easy to link the two quantities. Indeed if ||9g5'||oo ^ S for some 5 > 0, then 
a simple telescoping sum argument confirms that VE'((7*) > 0, and hence G A whenever i < 1/6. 
Thererefore 



(8-2) \\g\\e,A^\\dgno.- 

Suppose, conversely, that we know that g,g^, . . . , g" E A ~ B{id, I). Then certainly, by the escape 
property, we have g, g^, . . . , G i?(id, e) for some n' 3>£ n. Now if G were a Lie group, and if 5* were 
smooth with bounded derivatives, we would have 



(8.3) dgn''^ ^n'dg'i', 

the approximation being better as e gets smaller. This immediately gives the bound ||9g^||oo ~e 
l/n, and thus we have linked the escape norm and the auxiliary norm ||9(,5'||oo in both directions. 

Now unfortunately (j8.3p is only an approximate identity and, more seriously, G is not known to be 
a Lie group. In fact, as noted above, these Gleason lemmas are required to prove statements of that 
form. On a more positive note, observe that we only need to bound ||9g^'||oo above in terms of ||g||e,yi 
when g = u or g — V, and not for all g. We are at liberty to design the auxiliary function \I/ with this 
in mind. 

Now the exact version of (|8.3p is basically Taylor's formula, and it reads 

n-l 

(8.4) dg.^ = ndg^ + J2 dgdg-'i'- 

i=0 

(We replace n' by n for ease of notation.) This makes it desirable to bound the second derivatives 
dgdgi'^ . At this point another key idea enters: it is possible to get good control on these second 
derivatives when = (j) * \s the convolution of two "Lipschitz" functions, that is 



'^{x) = (p * ■iIj{x) = J (f>{xz ^)^{z)dz, 
the integral being with respect to Haar measure on G. This is because of the formula 

(8.5) dgdhi(l)*^)= I dg^{z)dh^^{z-^x)dZ. 



To make this useful, (j) is chosen to be somewhat Lipschitz with respect to shifts hy g = u and g = v, 
and ip is chosen to be Lipschitz with respect to the distance d. We omit the details. 

Rigorous argument. We turn now to the details of such a strategy in the discrete setting, that is 
to say a rigorous proof of Theorem 18.11 



Proof of Theorem \8.1[ To simplify the notation, we will abbreviate \\\\e.A in this proof as |j||e. 

We start with (i), which is a relatively easy consequence of the first trapping property in the 
definition of strong approximate group (Definition [73J. Indeed suppose that g, g^, . . . ,g" G A for some 
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n; then certainly 5'', (g'')^, . . . , (g'')" e A'^ C A^'^. By the first trapping property this imphes that 
g'\ {g^)"^, . . . , (g'')" e A for any n' ^ n/1000, and this confirms (i). 

The proof of (ii) is significantly trickier and is based on the construction of Gleason that was briefiy 
discussed earlier. In order to facilitate a certain technical "bootstrap argument" , it will be convenient 
to temporarily replace the escape norm ||g||e by the regularised version := ||.g||e + e, where e > 

is a small quantity. We shall obtain estimates uniform in e, and then let e — > 0. 

It is natural to introduce the norm-like quantity 

:= inf |f] llff.lli^) : 5 = 51 ■ • - Sn," ;s l| . 

It is clear that 
(8.6) 

We shall prove an estimate in the opposite direction, namely 

(8.7) UU^K'^^'U^^Xg). 

The exponent 0(1) will be independent of e. This implies that, for each positive integer n and all 
ffi, • • • ,9n, 

|l5l...5n|le^i^°(^)(llgi|li^)+--- + ll.9„lli^^). 

Letting e — ?> 0, we recover the product estimate (ii). 

In order to establish this we shall, as in Gleason's argument, relate \\g\\e and d^^^{g) to an auxiliary 
quantity ||9g\E'||oo, where : — >• [0, 00) is a certain "smooth" function supported on A"^. We will 
specify ^> shortly; as in Gleason's argument it will be constructed as a convolution of two functions 
(j) and '0- The former is taken to be a kind of smoothed version of 1^ defined using the metric d^^^ 
and Lipschitz for this metric, and the latter constructed using the set S appearing in the definition of 
strong approximate group (Definition [73J and Lipschitz with respect to the word metric on S. 

One link between these quantities is relatively easy to establish for any function with ^(id) ^ 1. 
Indeed suppose that ||i9g^'||oo = S for some g G A^°^. Then certainly |^'(5*) — ^'(g*^^)| ^ <^ for all i 
with g^ € A-^'^'-', which implies by an easy telescoping sum argument that ^'(g*) ^ 1 ~ 5i for all i. In 
particular lies in the support of ^, and hence in A**, for i < 1/^; note that the hypothesis G 
can be removed by induction. By the first trapping condition in Definition 17. II this implies that g^ Cz A 
for i < 1/1000^, and hence ||.g||e ^ 1000(5. Thus 

(8.8) s; iooo||ag*||oo 

whenever g e A^"". 

To establish (|8.7p and hence the product estimate (ii) it therefore suffices to prove a bound 

(8.9) ||a,vI/||^«ifO(i)d(-)(g) 

in the opposite direction for all g e A^^"^ (the claim for g ^ being an easy consequence). This 
argument will depend crucially on the specific form of The following two lemmas describe the 
construction of the functions (j) and ip. 
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Lemma 8.3 (Properties of (f)). There is a function (j)'^^^ : A^'^^'^ [0, 1] such that 

(i) = lforxe A; 

(ii) <j)'^^'>{x) ^Qifx^A^; 

(iii) (Lipschitz bound) For all g £ one has 

Here S^^y^B) := ini {d'^^\b^^y) : b £ B}, and A"^ is the complement of A in G. 
Proof. Define 

^ V d(^)(id,A^)^- 

Note that this is well-defined since d(^)(id, A=) 7^ 0; this would be an issue without the fudge factor of 
£ that we have introduced. 

Obviously = 1 for a; e A. If 4)^^\x) ^ then d^'^ {id., x~^ A) = A) < d^'^) (id, A'^), and 

so x~^ A contains a point outside of A'^ . This implies that x G A^. 

The Lipschitz bound is easily established. □ 

Lemma 8.4 (Properties of tjj). There is a function ip : ^lo™ _^ jq^ such that 

(i) '4){x) = 1 for X G A; 

(ii) il){x) ^ Q if X A^; 

(iii) (Lipschitz bound) ||(9ft«Vl|oo =^ l/10''ii:^ for h £ S and y e . 

Proof Let Q := ; recall from the definition of strong approximate group that C A, where 
N := lO^K^. Define V'(g) = if g <^ Q'^A, ^{g) = 1 if ,g e A and il){g) = l-ilN if g e Q'+^A \ Q^A 
for i = 0, 1, . . . , — 1. The claimed properties of -0 are easily checked. □ 

We now define ^ to be the convolution 

*(^) '-^Ym^ ^^'\ymy-'x) 

for all X £ with the convention that ^'(a;) =0 for x outside A^'^^ . 

We note that 

*(id) = ^^Y.cp^^Hx)^{x-') 5^ ^ E '/'^^H^)V'(x-i) ^ 1^ 

a property required in the proof of (|8.8p . Note also that since </) and are both at most 1 pointwise 
and are supported on A^ we have, for all x such that '^{x) ^ 0, 



that is to say 



(8.10) ll*llco<i^' 
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Let g e j4^°". Now since id e A we have the crude bound (i'^'^'(id, A"^) ^ e. It follows from Lemma 
Uthat ||ag(^(^)||oo < S-^\g)/e. From the identity 

we have that 

1 R^"* 
|5,vl/(x)| WdMooTj: E ^(y"'^) ^ — ^^''(3)- 

This immediately yields the crude bound 

(8.11) WdMoo ^ —d^'Hg) 

in the direction of (|8.9p . the statement we are trying to prove. 
Denote by P{X) the bound 

(8.12) < X%) 

for all g E A^™. We have just demonstrated P{K^ /e), and we wish to prove F(i4:0(i)), which is (|8Jt . 
To this end we will implement a bootstrapping argument, showing that P{X) implies a stronger version 
of itself, namely P{X') with some X' < X, under appropriate conditions. 

The hypothesis P{X) (cf. (|8.12p ) implies an improved Lipschitz bound on (j>. To see this note that 
if d''^\g) < 1/lOOOX then from assumption P{X) we have ||5g*||oo < 1/1000 and hence, from (|KH1), 
that llglle < 1. By definition of the escape norm this implies that g € A. Phrased in the contrapositive, 
it follows that S'^\id, A'^) ^ 1/lOOOX, and therefore the Lipschitz bound in Lemma l873l implies that 

(8.13) Wdg^oo ^ lOOOXd'-'Hg). 

The bootstrapping argument hinges on the Taylor expansion identity 

n-l 

4=0 

valid whenever 5, . . . € A^^" (say). This identity implies, using the triangle inequality and (|8.10p . 

that 



(8.14) |19,vl;|l^ ^ -||a,.*|U + - E ll^.'^^.*lloo ^ + - E ll^.'^^^^lloo- 

1=0 i=0 

To use this, we need to focus attention on the first and second derivatives of To bound the first 
derivative we use the identity 

V 

valid for h e A^™. Since (f> ^ 1, this and the Lipschitz bound on ip given in Lemma 18.41 implv that 

(8.15) Wdhnoo ^TJ\Y1 \\9hy^oo ^ 
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We turn to the second derivative dh.dg'^ for g <^ A and h £ S. Here we use the identity 

y 

RecaUing that 0, are supported on and using the Lipschitz bound (|8.13p on </> together with the 
Lipschitz bound on -0 given in Lemma |8.4[ we obtain the bound 

(8.16) wdhdgnoo ^ E Ii590iiooii9/.«^iioo l^xd^^Xg) 

]i g e A and h e S. 

These bounds are useful in (|8.14p provided that n is such that g,g'^,...,g" £ S. However, the 
second trapping property in the definition of strong approximate group ensures that this is so for a 
reasonably large value of n, indeed for n as large as io6j<-3||g|| ■ Taking n this large and substituting 
into (|8.14p yields 

||9g*||oo 10^Jf^||.9|U + ^Xd^^Hg) ^ X'\\g\\i^\ 
where X' — lO'^K^ + j^X and g e S. The claim also trivially holds when g ^ S. 

It is easy to improve this to the stronger statement P{X') using the triangle inequality ||9g/iVE'||oo ^ 
ll^g^'lloo + ll^h^'lloo, already observed in (|6.2p . Indeed for every 77 > there are, by the definition of d, 
gi, . . .gn such that g — gi . . . gn and 

d'-'H9)>\\9i\\i'^ + --- + \\9nt^-V- 

Therefore 



Pg^lloo \\d,Moo + ■■■ + \\dg,Moo ^ X'iWgS'^ + ■■■ + |lg„|l(-)) ^ X'ir, + d^^\g)). 

Since 77 was arbitrary, we do indeed obtain the bound ||9g5'||oo ^ ^'^(5)1 which is the statement 
P{X'). 

By repeating this deduction of P(X') from P{X) many times, we see that the crude bound P{K^/e), 
established in (|8.1ip . eventually implies P{1Q^K^), and hence (|8.9p . By earlier remarks, this concludes 
the proof of (ii), the inequality for products. 

Finally, we turn to the commutator bound (iii). Now that we have the product inequality (ii), we 
may define a function cf) obeying the properties in Lemma 18.31 but using ||5||e instead of the fudged 
quantity — \\g\\e + £, that is to say with 

n 

d{g) inf{E llsdU; 3 = 5i ■ • • 5n, " ^ !}• 

This is because (ii) implies the lower bound d{id,A'^) ^ K^'-"-^\ and in particular d{id,A'^) =/= 0. 
Moreover we have the Lipschitz bound 
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(8.17) \\dg<j>\\oo ^ K'^^'Uig). 

We will use this function cf) in establishing (iii), the bound for commutators. Once again we consider 
an auxiliary function $, defined now to be the convolution 

again with the convention that $ vanishes outside of We observe the identity 

for g,h £ where Tg denotes the shift defined by Tgf{x) := f{g^^x) if g~^x is well-defined, and 
otherwise. It follows that 

ii%,h.]$iioo s; wdhdgm^ + wdgdhn^. 

By the first bound in (|8.16p (which holds equally well for this $) we have 

2 ^ 

||9[g,h]$||cO 2^ \\dg(l)\\oo\\dhy(l)\\oO- 

I I VGA* 

From (|8.17p we obtain 

||a[,,,]<I>|U < K^^'^dig) sup d{hy) < KO^'^\g\U sup ||/i^|U. 

yeA"- yeA"- 

By part (i) , this implies 

||%,/.]<i>||oo^if°(^1g||e||/l||e. 

To conclude, we note that (|8.8p holds for this new auxiliary function $ as well, since the only fact we 
used in establishing that other than trapping properties of A was the lower bound <i>(id) ^ 1. 

This, at last, concludes the proof of Theorem 18. II □ 

To conclude this section we assemble the main results of it and the previous section in a portable 
form. The following is the only result we shall need from SJ7] and the present section going forward to 
the next (and final) part of the paper. 

Proposition 8.5. Suppose that A is an ultra approximate group and that it : —¥ L is a good 
model for A into a connected Lie group L with Lie algebra [. Let B be an arbitrary compact convex 
neighbourhood of in [. Then, for sufficiently small r, r' with 2r > r' > r > 0, we may find a large 
strong ultra approximate subgroup A' of A such that 

(i) 7r-i(exp(rB)) C A' C 7r-i(exp(r'B)); 

(ii) (A')-'^" is well defined; 

(iii) The escape norm \\g\\e,A' satisfies 

(a) (Conjugation) Ifg,h€ (A')^° then \\h-'^gh\\e 0{\\g\\^); 

(b) (Product) If n is a nonstandard natural number and gi, ...,(;„ € (A')^*^ is a nonstandard 
finite sequence of elements o/ (A')^*^ {i.e. an ultraproduct of standard finite sequences, 
see Sectionl^ then \\gi . . . gn\\e = 0{J2i=i Il5i||e)/ 
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(c) (Commutators) If g,h G (A')-'-° then we have \\[g,h]\\e — 0(||g||e||^||e)- 
(iv) The set H :~ {g G A'; ||.g||e ~ 0} is a global internal subgroup, that is to say it is of the form 
H = rin->a^n' ^^ere Hn C An contains id and is stable under multiplication and inverse, 
which is contained in A! and is normalised by A'. 

Proof. The existence of A' satisfying (i) and (ii) follows from part (iii) of Definition 13.51 If r, r' are 
small enough then Proposition 17.31 ensures that A' is a ultra strong approximate group in the sense of 
Definition 17.11 Properties (iii) (a) , (b) and (c) then follow immediately from Theorem 18.11 and taking 
ultraproducts, and (iv) then follows from (iii). □ 

Remark 8.6. Observe that if A is a strong ultra approximate group, that is to say an ultraproduct of 
iiT-strong finite approximate groups, and if L is a locally compact model of A as given for example 
by Proposition 16. 1[ then from the strong approximate group hypothesis made on A we see that the 
standard part of the escape norm st(||(7||e,A) and the escape norm of 'K{g) G L with respect to the neigh- 
borhood of the identity 7r(A) of L are comparable. Namely ||'7r(5)||e^(A) ^ st(||5||e.A) ^ |K(.g)||e.7r(A)- 
As a consequence, if we take the standard parts of the escape norm in properties (i) to (iii), then what 
we obtain is precisely the analogous properties for the escape norm in L with respect to 7r(A). In that 
case, the three properties are essentially equivalent to the original Gleason lemmas in the literature on 
Hubert's fifth problem, applied to the locally compact (local) group L. In the sequel however, it will 
be very important that the three bounds (i) to (iii) obtained in Proposition 18.51 hold at the ultra level 
in *R and not only at the level of standard parts. 

9. Proof of the main theorem 

In this section, we complete the proof of our main theorem. Theorem 14.21 We will do so by first 
reducing to the case when A has no global internal subgroup. For convenience, we introduce the 
following definition. 

Definition 9.1 (No small subgroups). An ultra approximate group A has the NSS property if A does 
not contain any non-trivial global internal subgroup. 

By a global internal subgroup of A = A^, we mean a subset of the form Jln->a '^here 

Hn C An is a genuine subgroup. Note that A is NSS if and only if, for any g G A\id, the escape norm 
||(7||e,A is non-zero (though it may be infinitesimal). We remark that an analogous NSS condition for 
locally compact groups plays a key role in the theory of Hilbert's fifth problem. 

Example 22. Let iV G *N be an unbounded (nonstandard) integer. Then the interval A := [—N,N] 
(in the nonstandard integers *Z) is NSS. Note that while A contains global subgroups such as Z or 
{a; G *Z ; X = o(A^)}, such subgroups are not internal (they are not the ultralimits of standard sets). 

Clearly, any ultra approximate subgroup of an NSS ultra approximate group is also an NSS ultra 
approximate group. Using the Gleason lemmas from Section |S] we can reduce the proof of our main 
theorem to consideration of the NSS case. 

Proposition 9.2 (NSS reduction). Let A be an ultra approximate group. Then there exists a large ultra 
approximate subgroup A! of A., with (A')^"*^" well-defined and contained in A^, and a global internal 
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subgroup H contained in A' and normalised by (A')^*^", such that A'/iJ is an NSS ultra approximate 
subgroup, which admits a connected Lie group as a good model. 

We refer the reader to Definition 13.51 for the definition of a good model. Here A'/H denotes the 
quotient local group as defined in Lemma [6.121 

Proof. By Proposition 17.21 there is a ultra (strong) approximate group A' C A"' which is large relative 
to A, for which (A')^*^" is well-defined and contained in A**, and a good model n : (A')^ — >■ L, where L 
is a connected Lie group. Let B be an open bounded convex symmetric neighbourhood of the identity 
in the Lie algebra of L. Then for sufficiently small r > 0, exp(r_B) contains no non-trivial subgroups 
of L. 

Let H denote the global internal subgroup H = {g G A'; ||5||e = 0} given by Proposition 18.51 
Since H is normalised by A', it is also normalised by (A')^°''°. We may then apply Lemma [B.12I 
and consider the quotient local group (A')i°"/i?. Then {A!f/H = {A! /Hf is well-defined. Since 
A', H are nonstandard finite symmetric sets, A'/H is also; since (A')^ can be covered by finitely many 
left-translates of A, (A'/iJ)^ can be covered by finitely many left-translates of A'/H. We conclude 
that A'/H is an ultra approximate group. 

Since exp(ri?) contains no non-trivial subgroups, the image of H under tt has to be trivial, thus 
the homomorphism tt descends to a homomorphism of A'/H to L, which satisfies the conditions for a 
good model (see Definition 13. 5p . By construction, every element g € A' that is not in H has positive 
(but nonstandard) escape norm ||.g||e.A'- If g G A' and {[g]) C A'/H, where [g] is the class of g in 
A'/H, then (g) C A'^. On the other hand A' is a strong ultra approximate group, and thus ||.g||e.A' is 
non-zero if and only if ||.g||e.A'2 is non-zero. This implies that every non-identity element [g] in A'/H 
also has positive escape norm ||[g]||e.A7ff- Thus A'/H is NSS and the claim follows. □ 

Let us now state Theorem 14.21 in the special case of NSS groups, and show how the general case of 
Theorem [42] follows from it. 

Theorem 9.3 (NSS approximate groups contain large nilprogressions). Let A be an NSS ultra approx- 
imate group which admits a connected Lie group L as a good model. Then A"* contains a nondegenerate 
ultra nilprogression P in normal form, which is large relative to A. Furthermore, the rank and step of 
P are no greater than the dimension of L. 

Proof that Theorem \9.3\ imvlies Theorem \4-.2\ Let A be an ultra approximate group. We may find a 
large ultra approximate subgroup A' of A which satisfies the conclusions of Proposition 19.21 We may 
then apply Theorem l9.3l to A'/H and find in (A')^/i? a nondegenerate ultra nilprogression Pq in normal 
form with |Po| ^ We can write Pq = Psiui, . ■ . ,u^', Ni, . . . , Nr), where Ni g *N are unbounded 

and Ui e A'/H. We may then pick arbitrary lifts Ui e A' and set P = P{ui, . . . ,Ur; Ni, . . . , Nr). 
Then HP is a nondegenerate ultra coset progression in normal form contained in (A')^ C A**, and 
\HP\ ^ \H\\Po\ > |A| as desired. □ 

We turn now to the proof of Theorem 19. 3[ which will occupy the remainder of this section. We 
begin with a brief sketch, fieshing out a little more the overview given in ^ The proof will proceed 
by induction on the dimension of the connected Lie group L. The base case of the induction, when 
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dimL = 0, is trivial as in this case the NSS uhra approximate group A is also trivial. To treat 
the induction step we will consider an element u of A with smallest possible escape norm. The 
existence of such an element is guaranteed by our standing hypothesis that approximate groups are 
finite objects, i.e. that each ^„ in A = Jln-i-Q ^" finite. Then we will mod out A by the geometric 
P := {u", \n\ ^ l/||u||e,A}, where u is an element of A which the smallest possible escape norm ||u||e,A- 
The quotient local group A/P (in the sense of Lemma [B.12[) will be shown to be both NSS and to 
admit a Lie group with dimension at most dim L— 1 as a good model. It is at this step that we crucially 
rely on the fact that we arc only quotienting out by a local group, the progression P, rather than a 
global one such as the group (u) generated by u. We do this in order to avoid accidentally creating 
torsion with an excessively large quotient. Indeed, it is because of this component of the induction that 
it was necessary to cast the entire argument in the setting of local groups rather than global groups, 
even if one had been willing to restrict the main results of the paper to the global group case. Finally, 
making key use of the properties of the escape norm given by the Gleason lemmas, we will lift the 
nilprogression from A/P to A. 
Let us turn to the details. 

Proof of Theorem \9.3\ Let A be an NSS ultra approximate group which admits a connected Lie group 
L as a good model n : A^ L. We proceed by induction on dimL and first dispose of the trivial case 
when L has dimension zero. As L is connected, it must thus be trivial. Applying Definition 13 . 5f iii) . we 
conclude that A* is a large global internal subgroup of A. Since A is NSS, this kernel must therefore 
be trivial. Therefore A is trivial. 

Now suppose that dimL ^ 1, and that the claim has already been proven for connected Lie groups 
of smaller dimension. To complete the proof of Theorem l9.3l it suffices to establish the following lemma. 

Lemma 9.4 (Induction step). Suppose that A is an ultra approximate group admitting a connected 
Lie group L of positive dimension as a good model. Then A contains large ultra approximate subgroups 
A'" C A" C A' C A with the following properties. Let u € A' he such that ||u||e,A' is minimal and 
non zero, and set P := {u" ; |n| < l/||M||e,A'}- Then P commutes with (A")^^ and obeys the following 
properties: 

(i) the quotient A!" jP is an ultra approximate group which admits a connected Lie group of 
dimension dimi — 1 as a good model, whose Lie algebra is formed from the Lie algebra of L 
by quotienting out by a one- dimensional central subalgebra; 

(n) if A is NSS, so is A!" jP; 

(iii) to any large ultra nilprogression Q in A" / P in normal form, one can associate a large ultra 
nilprogression Q in A" in normal form, whose rank exceeds the rank of Q by at most one, and 
similarly for the step; and 

(iv) (A"')'^CA". 

Proof of Theorem \ 9.3i Indeed apply the induction hypothesis to A"'/P, which we can do by (i) 
and (ii). We may then conclude, using (iv), that A" /P contains a large ultra nilprogression. Finally, 
apply (iii) to conclude. 
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Proof of Lemma \9J\ Take B to be some small convex neighbourhood of in the Lie algebra ( of L. 
We shall take A', A", A'" to be such that 

(9.1) 7r-i(exp(S)) C A' C tt^i (exp(l.OOlB)) 
and 

(9.2) 7r-i(exp((5B)) C A" C 7r-i(exp(1.0015B)) 
and 

(9.3) ^-i(exp(AB)) c A'" C 7r-i(cxp(1.001 ^s)). 

where (5 > is a small (standard) real number to be specified later. 

It follows from Proposition 18.51 that large ultra approximate subgroups of A exist with these prop- 
erties, and furthermore that, if B is small enough, the escape norm || • ||e,A' satisfies the conjugation, 
product and commutator inequalities laid out in (iii) of that proposition. Note also that A', A" admit 
L as a good model and have the NSS property. 

Property (iv) of the lemma is essentially immediate; we turn to the more substantial (i), (ii) and 
(in). 

We begin with the proof of (i). Recall that w e A' is chosen so that ||u||e.A' is minimal and nonzero. 
Observing that ||x||e.A' =^ 100^ for x G (A")^°, it follows from the commutator estimate of Proposition 
18.51 (iii) (b) that for such x we have 

||[a;,u]||e,A' = 0(||a;||e,A'||u||e,A') < ||w||e,A' 

provided that 5 is chosen sufficiently small in terms of the implied constant O(-). 

Note that [x,v] lies in A', rather than merely (A')^, since its escape norm is less than 1. From 
the extremal property of u, it follows that [x, u] — id, that is to say x commutes with u, whenever 
X e (A")i". 

Recall that we are taking P :— {-«" : n < l/||M||e_A'}- 

Since (A")^ is well defined, we may apply Lemma [6.121 and form the quotient local group A"/P ~ 
rin-j-a^n/^n, which is clearly also an ultra approximate group. We now show that A" /P admits a 
proper quotient of L as a good model. To do this, we first verify that 7r(P) is a non-trivial central 
one-parameter local subgroup of L. 

Since dimL ^ 1, the groups A', A" are non trivial, and this implies that ||u||e.A' is infinitesimal, i.e. 
that Mq :— l/||u||e,A' is unbounded. Let n e *N be such that n — o{Mq). We must have u" G kerTr, 
because m*^" € A' for all fc e N. Define a map cj) : [—100, 100] — > L by setting 

(9.4) <i)(t):=TT{u^'<^), 

where [-J is the (nonstandard) greatest integer function. Then tt{u"') = (f>{st{n / Mq)) for all n € 
[— IOOA/q, IOOA/q], and is a local homomorphism in the sense that (j){t)(j){s) = (j){t + s) whenever 
t,s,t + s e [—100,100]. Also tt{P) = (/)([— 1,1]). Finally, we verify that (j) is continuous. Because of 
the local homomorphism property, it is enough to check this at 0. If t is small, then {(f){t))'^ = (f){tk) G 
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exp(1.001i3) for every integer k e [0, j], hence € exp(1.001i?/fc) is close to the identity in L, which 
gives the desired continuity. 

As (/> is a continuous homomorphisni from [—1, 1] to the Lie group L, there exists an element X of 
the Lie algebra [ such that ip{t) = exp{tX) for all t E [—1, 1]. Moreover X G I.OOIB. On the other hand 
by the definition of the escape norm we have u^^° ^ A', and hence 0(1) = exp(X) ^ exp(i?) and thus 
X ^ B. In particular X is non-zero, and it follows that (/>([— 1, 1]) is a non-trivial local one-parameter 
subgroup of L. Finally it is central in L, because L is connected and (f>{t) = Tr{u^*^o^) commutes with 
the neighbourhood of identity tt{A") as shown above. Thus X lies in the centre of the Lie algebra [. 

If we choose a neighbourhood U of the identity in L small enough, then by Lemma IB. 121 we may 
form the quotient space U/(j){[—l, 1]), which one easily verifies to be a local Lie group of dimension 
dimi — 1, whose Lie algebra is obtained from the Lie algebra of L by quotienting out by a one- 
dimensional central subalgebra. By Lie's third theorem every local Lie group is locally identifiable 
with an open neighbourhood of a global connected Lie group L', which in our case still has dimension 
dimi — 1. Thus, by shrinking U if necessary, we may find a local homomorphism rj : U ^ L' whose 
kernel lies in (/)([— 1, 1]). The local homomorphism i] o n : (A")* L' then pushes down to a local 
homomorphism ■0 : (A")^/P — > L' . Choosing S smaller if necessary, we may assume that -0 is defined 
on all of {A" /P)^, thus making L' a good model for A" /P. Note we may also ensure that tt{A") 
contains no non-trivial subgroup of i', a property that will be needed in Lemma 19.51 below. This 
completes the proof of (i). 

We turn now to (ii), which asserted that A'" / P is NSS. In fact we shall prove the same statement 
for A" /P, from which the statement for A'" /P follows (or note that an identical proof works). Key 
to this endeavour is the following lifting lemma, which we will require again in the proof of (iii). 

Lemma 9.5 (Lifting lemma). Let g e A" /P, and let k : (A")® — {A")^/P be the projection map. 
Then there exists g G A" such that K{g) — g and ||g||e.A" = 0(||.g||e^A"/p)- 

Let us first remark on why the NSS property of A" / P follows quickly from this. Indeed suppose 
that g E A" / P is not the identity. Then the element g generated by the above lemma is not the 
identity either, and hence has positive escape norm since A" is NSS. By the lemma, g also has positive 
escape norm. Since g ^ id was arbitrary, this establishes the NSS property for A" /P. 

Proof of Lemma \9.5i Fix g E A" /P. Let ^ be a lift of g in A" which minimizes the escape norm 
||ff||e,A" among all possible lifts of g. If 5 is trivial, then so is g and there is nothing to prove. Therefore 
we may assume that g is not the identity and hence, since A" is NSS, that it has positive escape 
norm. Suppose, by way of contradiction, that ||.g||e.A"/p = o(||5l|e.A")- Our goal will be to reach a 
contradiction by finding another lift of g with strictly smaller escape norm than g. 
Set Af(' l/|jg||«,A" e *N. 

We now make an important deduction from our hypothesis. For every n G *N such that n = 0(M"), 
we have G A" /P. In particular, for every (standard) integer A; G N, g^-'^'^i E A" /P. This implies 
that the group generated by g^^^ lies in A" /P. However, in projection to the Lie model. A" /P gets 
mapped into a neighbourhood of the identity in L' , which we chose small enough so as not to contain 
any non-trivial subgroup. We thus conclude that g^^^ maps to the identity in L' , and therefore g^^^ 
maps into the local one-parameter subgroup 0([— 1, 1]). 
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Now there is another element which maps to 1, 1]), namely u, the element for which ||u||e,A' is 
minimal. 

In order to motivate the rest of the argument, let us temporarily work in a heuristic setting (using 
informal notation such as ~), returning to tighten the argument rigorously later. Since 

(9.5) A" ^TT-\exp{SB)), 

and since M[' is the least n for which escapes A", we have 



(9.6) n{~g^'")^cl,{S)=exp{SX). 
Similarly 

(9.7) 7r(u<) w (f>{S). 

Now m" takes at least as long as g" to escape from A' w 7r~'^(expi3). Hence (roughly) it takes as 
least as long to escape from A" sa Tr~^{exp6B) as well, which means that 

(9.8) M[' < M[;. 

We are trying to find a lift of g with smaller escape norm than that oi g. To do this it is sensible 
to look for elements of the form h gu^^", m E *N. Provided that m is chosen judiciously, h will also 
be a lift of g since (by definition) u lies in P. Since, measured in 0([— 1, 1]) by applying tt, the element 
u is "shorter" than g, it seems reasonable that by an appropriate choice of m we can make h shorter 
than g as well. 

Being a little more precise, suppose that to, n E *N. Since u is central in A" we have ft," — g^^u~™"' 
whenever these expressions are well-defined, and hence 7r(/i") — 7r(g")7r(u~'""). From (|9.6|) and (j9.7p 
we have 



(9.9) 7r(g") w 0(st((Sn/Af{')) and 7r(u-™") w (j){st{-Smn/Mi;)). 

These expressions will be legitimate if to, n are chosen so that the arguments of the (^'s always lie in 
[-50,50] (say). It follows that 

(9.10) ^(;,") « 0(st(-L - _g,)5r^). 

However (by the Euclidean algorithm) there is a choice of to S *N such that |1/M" — to/A/q | ^ 
1/2M^'. Comparing with (|9T0l) we see that for n = 1, . . . , 2M^' we have 7r(/i") = (j){6') with S' S. 
Since TT^^{(j){[0,S])) C A", we must raise h to at least the power 2Mq before it escapes A". Since 
2Mq > A/q' ^ Af", this h is a lift of g with smaller escape norm than g. Note that the computations 
(|9.10p are legitimate for this choice of to and for n ^ 2A/q'. 

We now perform the above argument rigorously. Instead of the heuristic statement (|9.5p , we must 
work with the inclusions 

(9.11) 7r-^(exp(^B)) C A" C 7r-^(exp(1.0015B)). 
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To get a precise form of (|9.6|) . note that by definition of the escape norm we have g^^^ G A", 
whilst g'^" ^ A". In particular, as a consequence of (I9.1ip . 7r{g'^''"^^) e exp(1.001i5B), whilst 7r(^*^") ^ 
exp{5B). Since M" is unbounded, the first of these actually implies that 7r(g*^i ) e exp(1.001(5i?). 

Similarly 7r(u*^o e exp( 1.01(5-8), whilst 7t{u^^o ) ^ exp((S_B). Once again, the first of these implies 
that 7r(M*^o') e exp(1.001(5B). 

Since i? is convex, comparison of these facts shows that 77(5*^1 ) = ip{t) and 7r(u*^o ) = (pif) with 

(9.12) e [0.9(5,1.15]. 

Suppose that M[ and A/q are the escape times of g and u from A', respectively. Since u £ A' 
was assumed to have minimal escape norm, M[ ^ Mq. On the other hand (|9.1ip implies that 
M'^/M'^,M'{/M[ e [0.995, 1.01(5], and so 

(9.13) M'{ s; l.lAf^'. 

As in the heuristic discussion above, take h :— gu^™', for some m G *N. Let n E *N. Then we have 
7r(5") = (/i(st(in/M{')) and 7r(M-"") = (/)(-st(i'mn/Af^')) 

provided that the arguments of the (^'s are in [—50, 50], which will always be the case later on in the 
argument. Since u is central we have 

/ t mt' 

Roughly as before, we use the Euclidean algorithm to find m e *N such that |l/Af" — mt' /SMq\ ^ 
t'/26M[;. By (I9T2)) and (|9T3| it follows that 



mt' 



6M[' 5M[; 



t' t 1 0.9 



2(5 Af^' ' 5' M'{ M[' 
It follows from this and (|9.14p that 7r(/i") g (/)([0, 5]) for n ^ Af", and hence /i" lies in A" for these same 
values of n. As a consequence, h has smaller || • ||e,A" escape norm than g, contrary to assumption. □ 



Finally we prove item (iii) of Lemma 19.41 Suppose then that Q is a nondegenerate large ultra 
nilprogression in A" /P in normal form; we wish to lift this to a large ultra nilprogression Q in A" of at 
most one higher rank and step, while preserving the nondegeneracy and normal form properties. The 
main difficulty is that if one lifts the generators of Q arbitrarily then there is no guarantee that the 
progression they generate, or even a significant part of it, will be contained in A". The key to ensuring 
that we do achieve this lies in making judicious use of the lifting lemma (Lemma 19. 5|) and the product 
and commutator properties of the escape norm (Proposition |83] (iii) (b) and (c)). At this point we 
advise the reader to quickly review Definition 12.61 and Appendix [CJ where nilprogressions in C- normal 
form are discussed. 

We may write the non-degenerate ultra nilprogression in normal form as 

Q ^ Piui,...,Ur;Ni,...,Nr.), 

where the Ui are in A" /P, the Ni £ *N are unbounded, and r is the rank of Q, and some standard step 
s. From the normal form hypothesis (and taking ultraproducts), we have the following properties: 
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(i) (Upper-triangular form) For every 1 ^ i < j ^ r and £», Cj G { — 1, +1}, one has 

(9.15) K,^]ep(u,,,,...,u.M^),...Mj^))- 

(ii) (Local properness) The expressions u"^ . . . u"'' for nonstandard integers ni, . . . , with \ni\ ^ 

for all 1 ^ i ^ r are all well-defined and distinct, if C is a sufficiently large standard real. 

(iii) (Volume bound) One has 

Ni...Nr-^ \Q\<^Ni...Nr. 

(Note that as the Ni are unbounded, 2[AriJ + 1 and Ni are comparable.) 

Also, since m"' £ Q Q A" /P for all 1 ^ j ^ r and \ni\ ^ Ni, we have 

II II 1 

\\Ui\\e,A"/P < 

By Lemma 19.51 we may find lifts Ui € A" which project to ut in the quotient local group A"/P, 
and are such that 

|1Mj|U,A" = 0(||Wi||e,A"/p) 

and thus 

(9-16) \\u,\\e,K" < 

In order to include P in the lifted progression, we set Ur+i := u, the generator of P, and Nr+i 
l/||"||e,A"- From (|9.4p we see that 

(9.17) < 7V^+i < M'^. 
We then define 

Q P(iii,...,Wr,Ur+i;eA^i,...,eA'"r+i) 
for some sufficiently small standard £ > 0. We claim that Q is well-defined in A" as a nondegenerate 
ultra nilprogression in normal form, of rank (r -I- 1) and step at most s + 1. 

We begin with the claim that Q is well-defined in A". From (|9.16p and Proposition 18.51 one has 

||5lU,A" < e 

for all g £ Q, and in particular every product in Q lies in A" as required. 

It is clear that Q is a nondegenerate ultra noncommutative progression of rank (r + 1). To show 
that it is a nilprogression of step at most s + 1, it suffices to show that any iterated commutator g 
of length s + 2 in the generators uj^, . . . ,u^^i is trivial. Using commutator identities such as the 
Hall- Witt identity 

(9.18) [z,[x,y]] = [[y-\z-%xry[[z,x~%y~'ry 

where := y~^xy (using the unbounded nature of the Ni to justify all operations) we may restrict 
attention to iterated commutators g of the form g — [h,uf^] where h is an iterated commutator of 
length s + 1 and 1 ^ i ^ r + 1. But by projecting down to A" /P, we know that the image of h vanishes 
and thus h E P. Since P is central in A", the claim follows. 
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Finally, we need to show that Q is in normal form. We begin by establishing the upper triangular 
form (p?T|) . i.e. that 

whenever \ ^i<j^r + \ and e^, e {—1, +!}• 

If J = r + 1, then Uj = u commutes with every element of A", and in particular with w^, so the 
claim follows in this case. Now suppose that j ^ r. From (|9.15p we then have 

[W%^-]ep(^,...,Ti,;0(|^),...,0(^) 



which lifts to 



Thus we may write u^A = gu^, where 



and ?^ ^ l/|lw||e^A' — Mq. From (|9.16p and Proposition 18.51 one has 

|IK%<-^]|ie,A" « ^ 



and therefore 
and hence 

(9.19) \K\Ua" « ^ 



In particular, ||w"||e.A" is infinitesimal, which implies that t:{u") — id and hence n — o{Mq) by ()9.4 
Since ||u||e,A" = 1/iVr+i, we conclude that 

Nr+l 



l"l « N.N, 



and thus 

Noting that e > is standard and thus can be absorbed into the 0{) notation, this gives the desired 
upper triangular property. 

Next, we establish the local properness. Suppose that 

Til "r+l _ n'l ",.+ 1 

Ui . . . — ■■■Uj,^i 

for some |ni|, ^ eNi. Quotienting by P, we conclude that 

ni rtr n, n' 

Ul . . .Ur = Ul ^ . . .Ur ^ ■ 

This quotienting can be justified because all products here lie in Q and hence in A". By the local 
properness of Q, we conclude if e is small enough that rii = for all 1 ^ i ^ r; we may then cancel 
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and conclude that 

^ id. 

Since ||u||e,A" = l/-^r+i and In^+i — ^r+il < ^r+i, this implies that n^+i = J^r+ii giving the desired 
local properness. 

From local properness one immediately has the lower bound 

IQI >7Vi...A^,7V,+i. 
Now we establish the matching upper bound 

IQI <7Vi...A^,7V,+i. 
We first recall from the normal form of Q that 

IQI « iVi . . . Nr. 

From construction it is also clear that the image of Q under projection by P lies in Q. It therefore 
suffices to show that the preimage of any element in Q contains at most 0{Nr) elements of Q. By 
construction of the quotient map, we see that the preimage is contained in a translate of P, and thus 
has cardinality 0(Mq); the claim then follows from (|9.17p . This concludes the proof of Lemma WM and 
thus Theorem 1131 □ 

We can now conclude the proof of Theorem l2.10| the most basic form of our main theorem. 

Proof of the first part of Theorem \2.1(A We argue by contradiction. Negating the quantifiers, we see 
that there exists some K ^ I and an infinite sequence of local groups Gn and finite if-approximate 
groups ^ Gn, n G N, for which the conclusion of the theorem fails, namely for which does not 
contain any coset nilprogression of rank and step at most n in n-normal form and of cardinality at least 

Now form the ultraproduct A = rin-i-a inside G = TliwaGn- By Los's Theorem (Theorem 
IA.6p . G is a local group and A an ultra approximate subgroup. We can now apply Theorem 14. 2 [ whose 
proof we just completed, to conclude that A^ contains an ultra coset nilprogression P in normal form 
with \P\ ^ \A\. Using Los's theorem again we conclude that P = Jln-i-a where for an a-large set 
of n, Pn is a 1/ c-proper coset nilprogression contained of rank and step at most 1/c and of size at 
least c\An\ for some standard positive number c > 0. But this contradicts the construction of the A^, 
thereby yielding the claim. □ 

To conclude this section we record another useful conclusion from the above analysis: Hrushovski's 
Lie model is nilpotent. 

Proposition 9.6 (Nonstandard finite approximate groups have nilpotent Lie models). Suppose that 
A is an ultra approximate group and that tt : L is a good model for A into a connected Lie group 

L with Lie algebra [. Then 1 and L are nilpotent. 

Proof. By Proposition 18.51 we may find a large strong ultra approximate subgroup A' of A obeying 
the conclusion of that proposition. By quotienting out the elements H of A' of zero escape norm as 
in the proof of Proposition 19.21 we obtain an NSS ultra approximate group A' / H . Now one runs the 
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argument in Theorem l9.3l An inspection of this argument shows that if one unfolds the induction from 
Lemma [9.41 the Lie algebra [ of L is repeatedly quotiented out by central algebras until it becomes 
trivial. Thus, I can be obtained from the trivial Lie algebra by a finite tower of central extensions and 
is therefore nilpotent as required. The nilpotence of L is an immediate consequence of this and basic 
Lie theory. □ 

10. A DIMENSION BOUND 

In this section we prove Theorem 12.121 in which it is shown that the rank of the nilprogression P in 
the main theorem may be taken to be 0{K'~''^^^). We will also show that so long as we work in a global 
group G, and replace with it may be taken to be 0{\ogK). By the usual ultraproduct 

argument, it will suffice to establish the following nonstandard analysis formulation of the theorem. 

Theorem 10.1. Suppose that A is an ultra global K -approximate group, thus A = Hn-s-a /'''^ some 
finite K -approximate groups, each contained in a global group Gn- Then A^^ contains an ultra coset 
nilprogression P in normal form with \P\ ^ |y4| and rank at most 0{K^ \ogK). Moreover, there exists 
a standard natural number m such that A™ contains an ultra coset nilprogression P in normal form 
with \P\ ^ |A| and rank at most 6 log2 K. 

Recall that the step of a nilprogression in normal form is always less of equal to its rank. The 
derivation of Theorem 12 . 121 from Theorem 110.11 proceeds analogously to the derivation of Theorem l2.10l 
from Theorem 14.21 and is omitted. 

It remains to establish Theorem 110.11 The arguments here are inspired by some remarks of 
Hrushovski in [32, §4]. In particular, a key tool will be the following lemma from 32, Lemma 4.9]. 

Lemma 10.2 (Doubling in a simply connected nilpotent Lie group). Let G be a connected, simply 
connected nilpotent Lie group of dimension d, and let A be a measurable subset of G. Let ^ be a Haar 
measure on G {note that nilpotent groups are automatically unimodular, and so there is no distinction 
between left and right Haar measure). Then /^(A^) ^ 2''/i(A). 

Proof. We use an argument of Gelander from Lemma 4.9]. As is well known (e.g. see [H]), in a 
simply connected nilpotent Lie group, the exponential map exp : g — > G is a diffeomorphism, which 
pushes forward the Lebesgue measure fig on the d-dimensional vector space g, the Lie algebra of G, to 
the Haar measure /i on G. Thus it will suffice to show that /ig(log(A^)) ^ 2'^/ig(log A), where log is 
the inverse of exp. But as contains {a^ : a e A}, log(A^) contains the dilate 2 • log A of log A, and 
the claim follows. □ 

One is tempted to combine this theorem with the Hrushovski Lie Model Theorem directly (i.e. 
Theorem 13.10p . to get some dimensional control on the Lie group L. However, there is a technical 
obstruction; the Lie model is only available for an ultra approximate subgroup A' of A, and the 
covering parameter K' of this subgroup A' may be much worse than the covering parameter K of the 
original ultra approximate subgroup A. To get around this problem, we need to choose the subgroup 
A' more carefully. A clue as to how to proceed is provided by the following basic observation (cf. 
Lemma 7.3]). 
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Lemma 10.3 (Slicing approximate groups by genuine subgroups). Let A be a {possibly infinite) K- 
approximate group in a global group G, and let G' be a genuine subgroup of G. Then A' :— A^ f] G' is 
a -approximate subgroup and H G' can be covered by at most left translates of A' . 

Proof. Since {A')^ C n G", it suffices to show that A^ D G' can be covered by left-translates 
of A' . But A'^, can be covered by left-translates of A since A is a ii'-approxiniate group. Next, 
observe that if a left-translate gA of A intersects A"^ D G' in at least one point g' , then 

gA DA^nG'CgAnG'C g'A'. 

Thus A* n G" can be covered by left-translates of A' , as required. □ 

Lemma [10.31 suggests that we should look for Lie models of approximate groups A' that are formed 
by slicing with a genuine subgroup G' of G. 

We turn to the details. Let A^ be a sequence of ivT- approximate groups in global groups G„, and let 
A = rin->-a ^® their ultraproduct; thus A is a ultra ii'-approximate group that lies inside an ultra 
genuine group Iln-fQ ^n- Proposition 16.101 we may find a model vr : (A) — > G of A'' by a locally 
compact group G. 

Let Uq be the neig hbourhood in Definition [331 We have Tr'^{Uo) Q A'^ and Uq C 7r(A'^). By 
Theorem IB. 17[ there is an open subgroup G' of G and a closed subgroup H oi G contained in Uq and 
normalized by G' such that L := G' /H is a connected Lie group. Let Ui C G' be an open subset such 
that H C Ui C Ui C Uq and let : G' — i denote the quotient map. 

Now set A' := A^n7r~-'^(G'). From Lemma fl . 31 applied to A^, we see that A' is a iC^-approximate 
group. We now also claim that A' is a nonstandard finite set, which would make A' an ultra K^- 
approximate group. To see this, observe first from Definition I3.5r ii) that 7r(A^) is contained in some 
compact set F. As G' is an open subgroup of G, it is also closed and FDG' is compact. We then see from 
Definition 13 . 5( iii) that we can find a nonstandard finite set A* such that TT~^{Fr\G') C A* C 7r~^(G'). 
Thus A' = A^n A*, and so A is a nonstandard finite set as required. 

Note that 7r(A') contains the open set Uq D G' and is itself contained in a compact subset of G'. 
Hence the set E := (/)07r(A') is precompact and contains a neighbourhood of the identity in L. Moreover 
{4> o TT)~^{(j){Ui)) C A', hence it follows that o vr : (A') ^ L is a good model for A'. From Lemma 
19. 6[ we conclude that L is nilpotent. Every connected nilpotent Lie group admits a unique maximal 
compact subgroup which, moreover, is central. Let N be the maximal compact subgroup of L and 
9 : L ^ L/N be the quotient map. 

We claim that dim(L/iV) ^ 61og2-ftr. To see this note that, as A' is a iiT^-approximate group, we 
see that E^ is covered by at most left-translates of E. Therefore 9{EY can be covered by at most 

2 

left-translates of 0{E), and hence 9{E) can be covered by at most translates of 9{E), where 
9{E) is the topological closure of 9{E), a compact set with non-empty interior. If we let /i be a Haar 
measure on L/N , it follows that 

On the other hand, from Lemma [10.21 one has 

fi(9(E)^) ^ 2'^™(^/^V(^M)- 
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Since 0(E) has non-empty interior, fi{9{E)) ^ and so comparison of these two inequalities imphes 
that 

(10.1) dim(L/iV) eiogsX. 

We now explain how to derive the second part of Theorem 110.11 from the above; we will turn to 
the first part later. We consider <j>~^{N), which is the kernel of the projection map from G' to L/N 
(= {G' / H) / N) . This is a compact subgroup of G". Since 7r(A') contains an open neighbourhood of the 
identity, we conclude that there exists a standard natural number m such that 4>''^{N) C 7r(A""^^), 
and this implies that A'™ contains the kernel of o o tt, which implies that ocfjon : A'*™ L/N is 
a good model. By Proposition 19 . 2 1 we conclude that A'^™ contains a large approximate subgroup A" 
with (A")^"°° well-defined and contained in A'^™, and a global internal subgroup H' of A" such that 
A" / H' is an NSS approximate subgroup with a connected Lie group as a good model. An inspection of 
the proof of that proposition reveals that we may take (A")^°°° inside A"" and that we may take the 
connected Lie group to be L/N , which we have shown to have dimension at most 61og2 K. Applying 
Theorem 19.31 we see that {A" / H'Y contains a large ultra nilprogression in normal form of rank at 
most (S\og2K, and thus (A")"* contains a large ultra coset nilprogression in normal form of rank at 
most 61og2-ftr. As (A")^ is contained in A"", which is in turn contained in A'*™, the second part of 
Theorem 110.11 follows (after redefining m). 

We now turn to the first part of Theorem llO.il As we see from the last paragraph, the difficulty here 
is that (t>^^{N) may not be contained in 7r(A'). We will show that nevertheless 7r(A') still contains a 
subgroup 0^^(A^o), where A^o is a closed subgroup of N with small codimension. For this the key is the 
following lemma, which is potentially of interest its own right. Here, and below, we write T'' := W^/U^ 
for the d-dimensional torus. By a subtorus we mean a closed connected subgroup of T''. 

Lemma 10.4. Let K,d ^ 1 and A be a closed K -approximate group in T"^ containing a neighbourhood 
o/O. That is, A is closed, contains a neighbourhood o/O, is centrally symmetric, and there is a finite 
set X C T'', \X\ ^ K, such that A + A C A + X. Then AA := A + A + A + A contains a subtorus 
T C T"^ of codimension at most 0{K^ logK). 

Before proving Lemma 110.41 we explain how to conclude the proof of Theorem 110.11 with Lemma 
110.41 in hand. First we observe that setting A'^ := A^ n tt~^{G'), tt{A[) is a neighbourhood of id 
in G. Indeed A** is and A"* C XA for some finite X, so that 7r(A) has non-empty interior, and 
hence 7r(A^) is a neighborhood of id. We now apply the lemma to A — cj) o tt{A'i), and conclude that 
<P~^{T) C 7r(A'^'*) C 7r(A'^). Writing 9' : L ^ L/T for the projection map, we see that A''^ contains 
the kernel of ^'o^ott and this implies that A'^ admits the connected nilpotent Lie group L/T as & good 
model. Moreover dim L/T = dimi/T'^ + dimT'^/T = 0{K'^ log K) by pin|) and by Lemma HUH The 
rest of the proof is then identical to the previous case: by Proposition l9.2l and its proof we conclude that 
A''^ contains a large approximate subgroup A" with (A")^'^°*' well-defined and contained in A''^, and a 
global internal subgroup H' of A" such that A" /H' is an NSS ultra approximate subgroup admitting 
L/A'o as a good model. By Theorem l9.31 we see that (A"/_ff')* contains a large ultra nilprogression in 
normal form of rank at most 0{K^\ogK), and thus (A")"* contains a large ultra coset nilprogression 
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in normal form with rank at most 0{K'^ log if). As (A")* C A'-^ C A^^, the first part of Theorem llO.il 
follows. 

We now turn to the proof of Lemma 110.41 Let /i be the normalized Haar measure on T''. Note 
that the group T'' of characters of T*^ identifies with Z'^. Our main tool is the notion of the (a-)large 
spectrum of an additive set A, defined by 

SpecJA) {e e Z'^ : lUiOl^M^)}- 

See [53l Definition 4.34] for this definition and a further discussion. 
If S" C Z'' is a set of characters, we write 

:= Pi ker^. 

Note that S-^ is a closed subgroup of T'' with codimension the rank of the subgroup of Z'' generated 

To prove Lemma [10.41 we first reduce to the case in which ij,{A) is somewhat large by establishing 
the following lemma. 

Lemma 10.5. Suppose that A <Z T"^ is a closed K -approximate group containing a neighbourhood of 
0. Then there is a subtorus Tq C T'* with dim(To) ^ d — 0{logK) and some xq G T'* such that, writing 
Ho for the Haar measure on Tq, we have fJ-o{{A + xq) Tq) 3> e"*-'^^'°s^). 

Then we handle the case in which fi{A) is somewhat large by proving the following, which is a 
straightforward continuous analogue of the so-called Bogolyubov-Chang lemma [9]. 

Lemma 10.6. Suppose that A C T'^ is measurable, that ^ a, and that fJ.{2A) ^ Kfj,{A). Then 

2 A — 2A contains a subtorus T C_ T'^ of codimension at most 0{K\og{l/a)). 

To deduce Lemma 110.41 from Lemmas 110.51 and 110.61 we proceed as follows. Locate an xq as in 
Lemma llO.51 and suppose furthermore that for this a;o the measure of (A + a:o)nTo is close to maximal 
in the sense that 

(10.2) fin{{A + xo) n To) ^ ^Mi^ + ^) ^ Tq) 

for aU X e T''. Set Ai := {A + xq) n Tq. Then , since A + A C A + X , we have 

Ai+Ai'Z{A + X + 2xo) n To. 

By (|10.2p it follows that iJ.{2Ai) ^ 2Kfi{Ai). We are now in a position to apply Lemma flO.GI to Ai, with 
a = e~'-'^^^°^^\ We conclude that there is a further subtorus T C Tq of codimension 0{K^ log K) 
inside 2Ai — 2Ai. Since 2Ai — 2Ai C 4A, this concludes the proof of Lemma [10.41 

For the proofs of both Lemmas 110.51 and 110.61 we will require the following lemma of Bogolyubov 
type. 

Lemma 10.7 (Bogolyubov-type lemma). Let ACT'' have positive measure and let k ^ 2 be a natural 
number. Suppose that 

S {n{A)/2n{kA)y^^^'''^\ 
Then kA — kA contains {Specg{A))-^ . 



64 



EMMANUEL BREUILLARD, BEN GREEN, AND TERENCE TAO 



Proof. It suffices (in fact, it is equivalent) to sliow tliat if x e (Spec^(A))-'- then f{x) > 0, wliere 
/ — * — * ■ ■ ■ * * 1-A * • ■ • 1-A is tlie convolution of k copies of 1a and k copies of 1-a- 
Now by the Fourier inversion formula we have 

(10.3) ^ E E 

where we have used the fact that £^{x) = 1 if ^ € Spec5(A) and x G (Spec^(A))-'-. Now Parseval's 
identity and the Cauchy-Schwarz inequality imply that 

On the other hand, by a second application of Parseval's identity, we have 



«^SpGc,(A) 



Substituting these inequalities into (|10.3p yields 

The lemma follows immediately. □ 

Lemma 110.61 is an immediate consequence of the case fc 2 of this lemma and (the continuous 
variant of) "Chang's lemma" [9 , which is the following statement. For a proof, see [53l Lemma 4.36]. 

Lemma 10.8 (Chang's lemma). Suppose that a < 1/2 and that A C_ is a measurable set with 
/x(j4) ^ a. Then Spec^(A) generates a subgroup of 7/^ of rank at most 0{5^^\og{l/a)). 

Proof of Lemma \l0.6[ Noting that iJ,{A)/^{2A) ^ 1/K, the lemma follows from Lemma 110.71 with 
k — 2 and S := 1/2\/K followed by an application of Lemma [10.81 □ 

To prove Lemma 110.51 we will apply Lemma 110.71 with a much larger value of fc, as well as the 
following result. 

Lemma 10.9. There is an absolute constant c > with the following property. Suppose that ^ C T'' 
is a closed K -approximate group containing a neighbourhood o/O. Then SpeC]^_^/ (^) generates a 
subgroup of T,'^ of rank O {log K). 

Proof. Let e — c/\ogK, where c > is to be chosen later. Suppose that ^ G SpeCj^_^(A) and that 
i^Q. Let ?7 : T'^ R/Z be such that £,{x) = e^^'^'^^^l Then 



1a(x)(,{x) dn{x) 
is real, since A is symmetric, and at least (1 — e)/i(A). Thus 

1a{x) 008(2711] (x)) d^{x) > {1 — e)^{A), 
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and hence the (symmetric) subset A'{£^) C A, consisting of those x for which cos(27r?7(x)) ^ 99/100, 
has measure fi{A'{£_)) > (1 — 100e)/u(A). In particular ||?7(a;)|| < whenever x G A'{£^), where 
ll^ll :^mi,ez\0-z\. 

Suppose now that SpeC]^_g(A) contains elements fi, . . . which are linearly independent over Q, 
and let 771, ... , 77,„ be the corresponding R/Z-valued characters. Consider the set A" := CiiLi A'{£,i)] 
provided that m < 1/lOOe, this wih have n{A") > ^fJ-iA). Note that A" = - A". 

Consider now the homomorphism ip -.T^ ^ T™ given by 2: i-> (771 (x), ... , ri,n{x)). The image of A" 
under ijj lies in a box of diameter 1/5. 

Now any subset U of T™ = (R/Z)™ which lies in a box of diameter < 5 is Freiman 2-isomorphic to 
an open subset of M™, and thus by the abelian case of Gelander's Lemma [10.21 (which, in this case, is 
just a very simple case of the Brunn-Minkowski inequality), we have 

{2U) ^ 2'"a*™(C/), 

where /i,„ is the normalized Haar measure on T™ = (M/Z)™. However, we have /i(5A") ^ IJ-{5A) ^ 
K^pi{A) ^ 2K'^fi{A"), and an application of the Ruzsa covering lemma (here our Lemma [5.2p shows 
that 2A" is a 4i^"'- approximate group, and consequently so is U := iIj{2A"). 
Therefore, noting that iJ.,n{U) 7^ since fJ,{A") ^ ^l^{A) > 0, we obtain 

2" < {AKy = {2Kf, 

a contradiction if m > 81og2 2K. Such a choice of m is acceptable if e < c/\ogK with c sufficiently 
small, and so we are forced to conclude that ^1, . . . , cannot exist. The lemma follows. □ 

Proof of Lemma \10.5[ Note that kA C (k - l)X + A, and that \mX\ ^ ml-^l = for all natural 
numbers m. It follows that ^{kA) ^ k^^{A), and so Lemma [10.71 is applicable with ^ = 1 — c/\ogK 
for some k ^ K'^' . The conclusion is that 2kA contains the subtorus Tq := (Spec2_c/iogA-(^))^ which, 
by Lemma [10.91 has codimension 0{\ogK). However 2kA is covered by at most (2k)^ = e'~"^^^°^^^ 
translates of ^, and so one of these translates has ^q{A + xq) ^ ^-o{K\ogK) ^ which was precisely what 
we claimed. □ 

To conclude this section we record the observation that the above arguments also yield the following 
more precise version of Proposition 16.121 the weak global Lie model theorem. This builds upon a 
previous result in this direction by Hrushovski: see [SH Theorem 4.2] and the discussion before [32l 
Lemma 4.9]. 

Theorem 10.10 (Strong global Lie Model Theorem). Suppose that A is a global ultra K -approximate 
group. Then there is a large ultra approximate subgroup A o/A™ for some standard m ^ 1 which admits 
a global model tt : (A) — >■ L into a connected, simply connected nilpotent Lie group L of dimension at 
most 6 log2 K . 

Furthermore, there exists a large ultra approximate group A! of A which admits a global model 
tt' : (A') — > L' , a connected nilpotent Lie group, whose maximal (central) compact subgroup N verifies 
L'/N ~ L. 
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11. Applications to growth in groups and geometry 

In this section we collect a variety of applications of our main results, in particular proving the 
various results stated in the introduction. 

As an application of his method Hrushovski [32] established the following strengthening of Gromov's 
theorem on groups with polynomial growth. 

Theorem 11.1. Let G be a finitely generated group and let K ^ 1. Suppose G = lJii>i ^n, where An 
is an increasing union of finite subsets of G such that {A^l ^ if |^„| for all n ^ 1. Then G is virtually 
nilpotent. 

This is indeed a strengthening of Gromov's theorem because if G has polynomial growth with respect 
to some generating set S then the An may be taken to be some subsequence of the word metric balls 
relative to S. 

Unsurprisingly, our main theorem also admits an application of this kind. The following is a corollary 
of Theorem 12. 101 and subsumes Theorem 111.11 above . 

Corollary 11.2 (Gromov-type theorem). Let K ^ 1. Then there is some K' , depending on I'C , such 
that the following holds. Assume G is a group generated by a finite symmetric set S containing the 
identity. Let A be a finite subset of G such that ^ K\A\ and C A. Then there is a finite 
normal subgroup N <1 G and a .subgroup Gi ^ G containing N such that 

(i) Gi has index Ox(l) in G; 

(ii) Gi/N has step and rank Ok(1). 

In particular G is virtually nilpotent. 

Proof of Corollary \11.2\ First we make the following simple observation. Suppose G is a group gener- 
ated by a finite symmetric set S and let Gq be a subgroup of index n = [G : Gq] . Then for every k < n 
the ball S'' meets at least fc + 1 different left cosets of Go in G. Indeed if not then by the pigeonhole 
principle we have 5* Go = 5*+^ Go for some i < k, and so by multiplying on the left with S it follows 
that S'^Go = S^+^Go- Muhiplying on the left by further copies of S implies that S^Gq = {S)Ga G, 
and so Gq has index at most k in G, contrary to assumption. 

Now, we apply Corollarv 11.71 Thus there exists a subgroup Go of G and a normal subgroup H of 
Go such that A may be covered by K' left-translates Go for some K' = Ok{^) depending only on K' , 
and Gq/H is nilpotent of step and rank O^^ (1). In particular. Go is finite-by-nilpotent. 

Using this value of K' , we see by assumption that is contained in A and thus is covered 
by at most K' cosets of Go. From our initial observation we conclude that [G : Go] ^ K' . 

Note that for some s ~ Ok (1) the s-th term of the central descending series G''(Go) is contained 
in H. Moreover, Gi Plggcff^of"^ ^ normal subgroup of G with index at most Oxi^) contained 
in Gq. Hence N :— G*(Gi) is a normal subgroup of G contained in H. On the other hand, Gi/N is 
nilpotent of complexity bounded in terms of K only and it has index Ox(l) in G/N. 

To conclude from this that G is virtually nilpotent, it suffices to show that Gi is. However Gi is 
actually finite-by-nilpotent (the finite group being N) and any such group is virtually nilpotent. To see 
this note that the kernel of the action by conjugation on is a nilpotent subgroup of finite index. □ 
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Remark 11.3. Recall that the condition ^ K\A\ implies the existence of an approximate group Z 
of size 0{K'="-^^A\) and of 0{K'="-^^) left translates of Z which cover A (see [STl Theorem 4.6]). Using 
Remark ll.9l and Theorem l2.121 we then see that Gi can be taken so that Gi/N is 0(log ii')-nilpotent in 
the sense that it admits a generating set mi, . . . , with £ — 0(log K) such that [ui, Uj] e (uj+i, ■ • ■ , Ui) 
for all i < j. In particular such a group admits a normal series with cyclic factors of length at most 
O(logif). 

Remark 11.4. If one assumes that A is a approximate group instead of the doubling condition 
|v4^| ^ K\A\ in Corollary II 1.21 then we may also conclude from Theorem 11.61 that N and a generating 
set of Gi are contained in A*. Using Theorem l2.12l we see that if one additionally wishes to ensure the 
logarithmic bound as in the previous remark, then one can only guarantee that N lies inside 

The following corollary is reminiscent of Gromov's theorem but it involves a weaker type of poly- 
nomial growth condition in which the generating set may be arbitrarily large. Furthermore it only 
requires that at one scale. 

Corollary 11.5. Let d > 0. Then there is R{d) > such that the following holds. Suppose that G is 
generated by a finite symmetric set S and that there is some scale r > R{d) such that \S^\ ^ r'^|S'|. 
Then there is a finite normal subgroup N <] G and a subgroup Gi ^ G containing N such that 

(i) N CS'f 

(ii) Gi has index Od(l) in G; 

(iii) Gi/N is 0{d)-nilpotent [see Remark ] 1.9\ for a definition). 

Proof. Our assumption is that ^ r'^jS'l. Let K = 2 ■ and Ck be such that, in the last part of 
Remark 1 11.4[ N lies in A'-''^ . We claim that there is some tq, ^/r ^ ^ r/2GK, such that A := S^" 
has \A^\ < lO'^l^l. Note that is then a K-approximate group with K ~ 2 ■ lO'^ (see Lemma [??^ . 
Applying Corollarv 111.21 and Remark 111.41 and ensuring that R{d) is so large that R{d) > [K'Y {K' 
being the quantity in Corollary lll.2p , we obtain a finite normal subgroup N <l G and a subgroup 
Gi ^ G containing TV such that Gi has index 0(j(l) in G and Gi/N is 0{\ogK) = 0(d)-nilpotent. 
Furthermore N and a set of generators for Gi are contained in A^*-^^ = S"^"^"^'"^ C S^. 

It remains to justify the claim. If it is false then jS*^'^^^! > 10''|S'^'^| whenever b'^^/r < r/lOCx, 
and in particular \S'^\ ^ (^lQ<iy°s^(Vr/wCK)-i^g\/r^^ jj- j. jg greater than some absolute constant, this is 
greater than r'^jS'l, contrary to assumption. □ 

Remark 11.6. Note that there is no bound on the size of TV. Indeed, if G is a large finite simple group 
and S ~ G then N must equal G, which shows that | can be arbitrarily large compared to d, r. 

In [50] Y. Shalom and the third author gave a quantitative refinement of Gromov's theorem inspired 
by Kleiner's recent new proof (see also [37l Corollary 4.2] for an earlier result in that direction). A 
consequence of their result is that a polynomial growth condition at one large scale is enough to 
guarantee virtual nilpotence. We take the opportunity to record that this follows easily from Corollary 
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Corollary 11.7. Let d > 0. Then there is R{d) > such that the following holds. Suppose that G 
is generated by a finite symmetric set S containing the identity and that there is some scale r > R{d) 
such that |S'''| ^ r'*. Then G contains G' , where 

(i) G" has index Od{{r'^)l) in G; 

(ii) G" is nilpotent with step 0^(1) • 

Proof. We apply Corollary 111.51 to obtain groups A^, Gi with the properties stated there. As N is 
contained in S**", it has cardinality at most r'^. The group Gi acts on N by conjugation; since the 
permutation group of N has cardinality at most (r"*)!, we conclude that the stabiliser G' of this action 
has index at most {r'^)\ in Gi. As Gi/N is nilpotent of step Od(l), we conclude that G' is nilpotent of 
step Od{l) + 1 = Od{l), and the claim follows. □ 

Remark 11.8. As observed in the last section of Gromov's original paper [26], Gromov's theorem on 
polynomial growth already easily implies a weaker result of this kind in which the hypothesis is that 
\S^\ ^ r'^ for all r ~ 1,2, ... , R{d). Note that this result of Gromov (and, a fortiori. Corollary [TTT7]) 
have content even when the group G is finite. Another weakening of the above result appears in [5 7) , 
where |5'''| ^ r'' is assumed for infinitely many r rather than for all r. 

Corollary [TTT7| is stronger than the results in [371 ISS] in the sense that the bounds do not depend on 
the cardinality IS*] of S. On the other hand, the results in [37l[50], which follow a strategy close to that 
of Kleiner's work [36], yield more effective quantitative control on the index and step of G', especially 
in the case when S is of bounded cardinality. 

Another consequence of our main theorem is that polynomial growth in the sense of Corollarv lll.5l 
at one large scale implies polynomial growth at all subsequent scales. 

Corollary 11.9. Let d > 0. Then there is R'{d) > such that the following holds. Suppose that 
G is generated by a finite symmetric set S and that \S^\ ^ r'^\S\ for some r ^ R'{d). Then {S"^ \ ^ 

(/)Od(l)|5| ^1 ^ 

Proof. A simple modification of the proof of Corollarv 1 1 1 . 51 shows that there is some ro, Vr ^ ro ^ r/6, 
such that |S'5''«| < K\S''''\ where K = lOO'^ (say). Applying Corollary [Ullj with A 5''" (as before) 
we obtain a normal subgroup H C 3"^^° such that G / H is virtually nilpotent with the index, step and 
number of generators of the nilpotent subgroup Gi/H all being 0^(1). 

Now by Corollarv 15.21 — S'^''" is a 2_R'-approximate group. This means that there is some set 

|A:| 2K, such that 3^''° C X3^''°. From this it follows that 

(11.1) S^"'"'" C x"'S^''° 

for every positive integer m. 

Let TT : G — >■ G/H be the quotient homomorphism. We have 

(11.2) \s2mro^ ^ |i/||7r(52™''°)|, 

since the cardinality of any fibre is at most \H\. 

From (jll.ip and the fact that vr is a homomorphism we have 

(11.3) 7r(S'2"'-«) C 7r(X)"7r(S'2'-«). 
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On the other hand, since H C S'^''", we have 



(11.4) 



\H\\tt{S^'-°)\ < IS*"'"]. 



Moreover, since ^/r ^ vq ^ r /6, we have 



(11.5) 



Putting (fTr2]) . ((TO)) . (pTil and (pTS)) together gives 



(11.6) 



Now 7r(X) is a set of size Od{l), contained in a virtually nilpotent group in which the index and step of 
the nilpotent subgroup are 0^(1). Every such group is a quotient of one fixed virtually nilpotent group 
with number of generators, index and step of the nilpotent subgroup also Od{l) and whose generators 
are lifts of the elements in Tr{X). Hence there is a bound of the form 



whenever r' is a multiple 2mro with m > 1. It is not hard to see that the same estimate therefore 



A consequence/reformulation of the preceding result is the following. 

Corollary 11.10. Let a > 0. Then there are tq € N and (3 > with limQ_j.o /3(a) — such that the 
following holds. Let G be a finite group generated by a symmetric set S and assume that the diameter 
of the associated Cayley graph satisfies diam5(G') ^ (|G'|/|S'|)". Then \S'^\ ^ min{r^/'^|S'|, |G|} if 
r ^ ro(a). 

Proof. If this docs not hold for some r, tq and /3 then as soon as ro is large enough (in terms of /?) 
Corollary [TL9] applies and yields jS""! ^ 'n'^\S\ for a\\ n ^ r and some e = e(/3) > 0. In particular, 
when n reaches the diameter of G, we obtain 5" = G so \G\ ^ (diam5(G))'^|S'|. This contradicts our 
hypothesis if e < 1/a. □ 

We shall apply CoroUarv 1 1 1 . 101 later on to deduce an isoperimetric inequality; see Corollarv lll.151 
Finally we show that by repeatedly applying Corollarv lll.2l we can obtain the following more precise 
result, which says something non trivial for finite groups as well. We say that a polycyclic group has 
length at most L if it is obtained from the trivial group by at most L successive extensions by a cyclic 
group. 

Corollary 11.11. Let G be a group which has a left-invariant metric d : G x G ^ [0, oo) satisfying 
the following conditions for some K ^ 1: 

(i) (Uniform doubling property) We have \B(2r)\ ^ _ftr|B(r)| for every r > 0; 



\7T{Xr\ m' 



Od(l) 




holds for all r'. 



at the expense of increasing the exponent Orf(l) if necessary. 



□ 
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(ii) (Finiteness condition) There are at most K different subgroups of the form {B{r)) as r ranges 
over (0, oo). 

Then G has a subgroup of index at most 0^(1) which is poly cyclic of length O/f (1). 

Proof. Given d S N and i? ^ we claim that if there are at most d groups of the form {B{r)) for r ^ R, 
then {B{R)) contains a polycychc subgroup of index OK,d{i)- This is clearly enough to establish the 
corollary. To prove the claim, we proceed by induction on d. It is clear for d — 1, since {B{R)) is then 
the trivial group. 

Let Rq be the upper bound of those R' ^ such that there are at most d — 1 groups of the form 
{B{r)) for r < R' . Without loss of generalit50 < Rq R. Then (B(r)) = {B{R)) whenever 
Rq ^ r ^ R. By the induction hypothesis, {B{Rq/2)) contains a polycyclic subgroup P of index 
OkA^) and length OkA^)- 

Let K' = Ok(1) be the constant obtained in CoroUarv fTT^ Setting S = B{Rq) and A ^ B{K'Ra), 
we may apply CoroUar v 111.21 and conclude that G = {B{Ro)) contains a subgroup Gi of index Ok{^) 
such that Gi has a normal subgroup N C B{4:K'Ro) with Gi/N nilpotent with step and number of 
generators Oi<-(l). It is enough to show that Gi has a polycyclic subgroup of index OkA^)^ because 
then so wiU G = {B{R)). 

By the uniform doubling assumption and a covering argument, B{AK' Rq) can be covered by Ox(l) 
translates of B{Rq/2). It follows that N can be covered by OkA^) translates of P, and in particular 
[N : N \^P] — OK.d{^)- Now n P is a subgroup of P and hence is also polycyclic of length OK.d{^)\ 
in particular, it is generated by OkA^) elements. Therefore so is A^, and hence N^, the intersection of 
all subgroups of N of index at most [N : N f^P], has index OkA^) i'^ ^- (To see this recall Schreier's 
theorem that if 5' is a symmetric generating set for a group F, and if F' ^ F has index fc, then S^^~^ 
contains a set of generators for F'.) 

The group A'o, being a subgroup of iV n P, is polycyclic. It is also characteristic in N and hence, 
since N is normal in Gi, A'o is also normal in Gi. 

However Gi acts by conjugation N/Nq, and the kernel of this action is a subgroup G'^ of Gi 
with index OkA^)- Now {N D G'i)/Nq is central in G'i/Nq and of size OkA^)- We thus have 
Nq ^ N O G'l ^ G'l, where each successive quotient is polycyclic of length OkA^)- I* follows that G'l 
is polycyclic of length OA'.ci(l), which is what we wanted to establish. □ 

Remark 11.12. There are examples of groups which satisfy the assumptions of Corollarv 111.111 vet have 
no nilpotent subgroup of index 0^(1). For instance, let p be a large prime and set G (Z/pZ)^ x: Z, 
where the action is by an element of SL2(Z/pZ) which is a diagonal matrix 7 of the form 7 := 
diag(a;, where a; e F* is a generator of the multiplicative group of Fp. Then no subgroup of G of 

index less than 1 is nilpotent (note that such a subgroup must contain (Z/pZ)^ and be the preimage 
of the subgroup of Z with that index). However we can endow G with a uniformly doubling weighted 
word metric (with 3 generators) by letting the two standard generators of (Z/pZ)^ each have weight ^ 
and 7 have weight 1. 

We turn now to some geometric applications of the above results. 

l^Recall that B{R) is the closed ball {g e G; d{l, g) ^ _R}. 
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Manifolds with a lower bound on Ricci Curvature. A. Petrunin suggested to us some years 
that a result such as Corohary 111.51 would give a purely group-theoretical proof of a theorem of 
Fukaya and Yamaguchi ISi according to which fundamental groups of almost non-negatively curved 
manifolds are virtually nilpotent. Recall that a closed manifold M is said to be almost non-negatively 
curved if one can find a sequence of Riemannian metrics on it for which diam(Af) ^ 1 while /^Tm ^ ~l/n 
where Km is the sectional curvature. Indeed, a simple application of the Bishop-Gromov inequalities 
combined with Corollary [TT3] yields the following improvement assuming only a condition on the Ricci 
curvature. 

Corollary 11.13 (Ricci gap). Given d G N, there is e{d) > such that the following holds. Let M be an 
d- dimensional compact Riemannian manifold with Ricci curvature bounded below by —e and diameter 
at most 1. Then i:i{M) has normal subgroup of index 0(i(l), which is finite-by- (0(d) -nilpotent). In 
particular tti (M) is virtually nilpotent. 

Proof. Fix a base point xq on the universal cover M and let be a Dirichlet fundamental domain 
based at xq for the action of F 7ri(M): that is, 

J- := {p £ M : d{xQ,p) ^ d(7 • xq,p) for aU 7 G F}. 

Set 5 := {7 € F : ^(7 • a;o,a;o) < 3}. Note that diam(J') < 1 and that S is symmetric and contains 
1. Observe further that S generates F and that for every integer r ^ 1 we have B{xo,r) C S**" • C 
B{xo, 3r + 1), where B{xQ,r) is the ball of radius r on M for the Riemannian metric lifted from M. It 
follows that 



1^1 \B{xo,3r + l)\ 
^ ' ' \S\ ^ \B{xoA)\ • 

From the assumed Ricci curvature bound and the Bishop-Gromov volume comparison estimates 
(see [ini Theorem 4.19]) we have the bound 

\B{xo,r)\ ^ |B_,(r)| 



|B(xo,l)| ^ |i?-.(l)l' 

where B_^{r) is a metric ball in the comparison model space with constant curvature —e and dimension 
d. The volume of this ball is \B^,{r)\ = ^-^\B^i{r / = Cd {?^2H^y~^dt, where > is 
the volume of the d — 1-dimensional unit sphere (see [161 p. 138] for this volume computation). As e 
tends to 0, this tends to Cdr'^/d. Combining this with (|11.7p we obtain that for every Rq ^ 1 there is 
some eo = eo('^, ^0) such that 

^^2(3^ + 1)-^ 

for all r ^ Rq provided that < e < eo- Letting Rq ~ i?o(2d) be as in Corollarv lll.51 we obtain the 
existence of some s — e{d) > for which the conclusion of that statement holds. This completes the 
proof. □ 



'See also |http : //mathoverf low . net/quest ions/11091 [ 
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Remark 11.14. The fact that 7ri(M) is virtually nilpotent under the above Ricci bounds assumptions 
was obtained by Cheeger and Colding in }10| (and had been conjectured earlier by Gromov) and their 
proof was recently completed and extended by Kapovitch and Wilking [35 , who also established that 
the index of the nilpotent subgroup is uniformly bounded by a constant depending on the dimension 
d only, an improvement which seems beyond the scope of our methods. This extended earlier work of 
Kapovitch, Petrunin and Tuschmann in |34j which proved the same result under sectional curvature 
bounds instead of Ricci. The work of these authors is, unlike our work, differential-geometric in nature. 
The linear dependence in d of the nilpotency length proven in our CoroUarv 111.131 seems new however. 

An isoperimetric inequality. It has been well-known since the work of Varopoulos on Kesten's 
conjecture ([Ml [41]) that isoperimetric inequalities on Cay ley graphs are closely related to lower bounds 
on the volume growth. Using this idea and Corollary Ill.lOi we can derive the following property of 
finite Cayley graphs with a polynomial upper bound on the diameter. 

Corollary 11.15 (Isoperimetric inequality on finite groups). Let a > 0. Then there are rp G N and 
/3 > with limQ,_>o /^(q^) — such that the following holds. Let G be a finite group generated by a 
symmetric set S and assume that the diameter of the associated Cayley graph satisfies diam5(G) ^ 
(|G|/|S'|)". Then for every subset E mG with \ra \E\ ^ \\G\, \dE\ ^ \\S\l^\E\^-f^ . 

Proof. This follows almost immediately from Corollary 111.101 and the following well-known lemma, 
which may be found in [27, Chapter 5] or 41 and references therein. For the convenience of the reader 
we offer a self-contained proof. □ 

Lemma 11.16 (Isoperimetry versus growth). Let G be group and S some finite symmetric generating 
set containing 1. Let B{r) — S'' be the word ball of radius r in the word metric. Let dE = SE \ E be 
the boundary of a subset E <Z G. If E Q G is a set, write r{E) for the infimum of those r for which 
\B{r)\ ^ 2\E\. Then for all E with \E\ < \G\/2 we have \E\ < 4:r{E)\dE\. 

Proof. We give a proof for the reader's convenience. Let f = 1e the indicator function of the set E, 
and fr := \B{r)\ '^geB{r) 3 ' / tie the average of / over balls of radius r. By the triangle inequality we 
have IIS'/ — /111 \g\' max^gg ■ / — / ||i, where \g\ is the distance to the identity in the word metric. 
Moreover | |s • / - /| |i = |s£' A ^ 2|5£;| for every s € S. Hence \\fr ~ f\\i ^ 2r\dE\. On the other 
hand for every x G E, there are at most \E\ elements g E B{r) such that g ■ 1e{x) 7^ 0. Therefore if 
\B{r)\ ^ 2|£;| then fr{x) i and hence ||/^ - /jji ^ i|£'|. The claim follows. □ 

In [1], Benjamini and Kozma conjecture that one can take /3 = a in the Corollary 111.151 (at the 
expense of introducing a possible multiplicative constant in place of 15*1^/8 in (ii)). This, however, 
is beyond the scope of our method. We would like to thank Itai Benjamini for drawing our attention 
to their work and its connection to Gromov-type theorems. 

A GENERALIZED Margulis LEMMA. In hyperbohc geometry, the Marguhs lemma asserts that there 
is a constant e — e{n) > 0, the Margulis constant, such for any discrete subgroup F of isometrics of the 
hyperbolic n-space H", and any point x € H", the almost stabiliser Fe(a;) := {7 G F : d{'j ■ x,x) < e} is 
virtually cyclic. This lemma is important for describing the geometry of cusps in hyperbolic manifolds. 
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or for establishing volunie lower bounds (see e.g. |54j). Various generalisations of this lemma have 
been established in the past for more general Riemannian manifolds under curvature upper and lower 
bounds (e.g. [HI chap. 6]). Typically in these results, unless the manifold has strictly negative curvature, 
"virtually cyclic" in the conclusion of the lemma must be replaced by "virtually nilpotent" . 

In [27l §5.F] Gromov raises the issue of establishing a generalized Margulis lemma under very weak 
assumptions on the metric space and he proposes a conjectural statement in this direction. Below we 
answer Gromov's question affirmatively. 

A metric space X is said to have hounded packing with packing constant K if there is K > Q such 
that every ball of radius 4 in X can be covered by at most K balls of radius 1. Say that a subgroup F 
of isometrics of X acts discretely on X if every orbit is discrete in the sense that {7 e F : 7 • a; € S} is 
finite for every x £ X and for every bounded set E C X. 

Corollary 11.17 (Generalized Margulis Lemma). Let K ^ 1 be a parameter. Then there is some 
e{K) > such that the following is true. Suppose that X is a metric space with packing constant K , 
and that T is a subgroup 0/ isometrics of X which acts discretely. Then for every x £ X the "almost 
stabiliser" Ti;{x) — {Se{x)), where S^ix) := {7 € F : 1^(7 ■ x,x) < e}, is virtually nilpotent. 

Proof. Each set Sr{x) is symmetric and contains the identity. Now by the assumption on X the ball 
B{x, 4) can be covered by collection of balls B{xi, 1), i = 1,2, . . . , K . Suppose that for i = 1, 2, . . . , fc 
there is at least one element 7^ S S4{x) with 7^ • a; e B{xi, 1). Suppose now that 7 e 5'4(x) is arbitrary; 
then there is some i € {1, 2, . . . , fc} such that 7-x e B{xi, 1). But this means that d{^-x, ^i-x) < 2, and 
therefore 7j^^7 £ 5*2(2;). This implies that 6*4(0;) C lJi=i 7j'^2(a;), which yields (since S2{x)'^ C Si{x)) 
the doubling estimate |5'2(a;)^| ^ K\S2{x)\. 

Let K' = K'{K) > be the constant from CoronaryHTl Set e 2/K', S = S,{x) and A = 6*2(2;). 
A direct application of Corollarv 111.21 shows that l^e{x) — (5) is virtually nilpotent. □ 

Remark 11.18. This confirms Gromov's conjecture, which suggested the same conclusion under the 
slightly stronger hypotheses that every ball of radius R'm X can be covered by at most C{R/r)™ balls 
of radius r for all < r < i? ^ 1 and some fixed constants C, m > 0. 

The assumptions of this generalized Margulis lemma are satisfied for example if X is a complete 
Riemannian manifold with a lower bound on its Ricci curvature, by an immediate application of the 
Bishop-Gromov volume comparison estimates. In this case, the result was proved by Cheeger-Colding 
[To] and Kapovitch-Wilking [35], namely: 

Corollary 11.19. Let d ^ I be an integer. Then there is e = e{d) > with the following property. 
Suppose that M is a d- dimensional complete Riemannian manifold with a Ricci curvature lower bound 
Ric ^ — (d — 1) and that T is a subgroup o/Isom(M) which acts properly dis continuously by isometrics 
on M. Then for every x £ M the "almost stabliser" Fe(a;) := {7 G F : d{'j ■ x,x) < e} is virtually 
nilpotent. 

In fact the result in [35l Theorem 1] is a stronger version of Corollarv lll.l9[ establishing that T^{x) 
has a nilpotent subgroup of index 0^(1). This stronger result seems to be beyond the scope of our 
method. 
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We also note that Corollary 111.111 applies to the Margulis lemma in the context of Riemannian 
d-manifolds with a lower bound on sectional curvature, because then the Gromov short basis has 
bounded cardinality from Toponogov's theorem (see for instance [U 37.3]). We thus get this way an 
alternate proof of the Fukaya-Yamaguchi theorem [15] according to which almost non-negatively curved 
n-manifolds have 0„(l)-virtually polycyclic fundamental group. Again, by ^4] we know better, namely 
that they are 0„(l)-virtually nilpotent, but once again this seems beyond the scope of our method. 

Finally we would like to remark that the usual proofs of the classical Margulis lemma bear some 
resemblance to the proof of our main theorem in as much as they use a similar "shrinking commutator 
trick" to establish nilpotence. While we proved this shrinking commutator estimate for the escape norm 
associated to an approximate group as part of the Gleason lemmas (Theorem 18. ip . in the Margulis 
lemma, one proves a similar estimate for the norm ||7]|a; — d{'-f ■ x,x) by a riemannian geometric 
argument using the assumed curvature bounds. This "shrinking commutator trick" dates back at least 
to Bieberbach |2j in his proof of Jordan's theorem on finite linear groups. 



Appendix A. Basic theory of ultralimits and ultraproducts 

In this appendix we review the machinery of ultralimits and ultraproducts. We will borrow some 
terminology from nonstandard analysis in order to do this, although we will not rely too heavily on 
nonstandard machinery in this paper. 

We will assume the existence of a standard universe il which contains all the objects and spaces 
that one is interested in (such as the natural numbers N, the real numbers R, the classical Lie groups, 
etc.). The precise construction of this universe is not particularly important for our purposes, so long 
as it forms a set. We refer to objects and spaces inside the standard universe as standard objects and 
standard spaces, with the latter being sets whose elements are in the former category. 

We will rely heavily on the existence of a nonprincipal ultrafilter. 

Lemma A.l (Ultrafilter lemma). There exists a collection a of subsets of the natural numbers N with 
the following properties: 

(i) (Monotonicity) If A & a and B Z) A, then B E a. 

(ii) (Closure under intersection) If A, B £ a, then AO B £ a. 

(in) (Maximality) If A C N, then either A e a or N\A e a, but not both. 

(iv) (Non-principality) If A Cz a, and A' is formed from A by adding or deleting finitely many 
elements to or from A, then A' €z a. 

We refer to a collection a obeying the above axioms as a nonprincipal ultrafilter. 
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Proof. The collection of cofinite subsets of N already obeys the monotonicity, closure under intersection, 
and non-principality properties. Using Zorn's lemma^i. one can enlarge this collection to a maximal 
collection which, it may be verified, has all the required properties. □ 

Throughout the paper, we fix a non-principal ultrafiltcr a. A property P(n) depending on a natural 
number n is said to hold for n sufficiently close to a if the set of n for which P(n) holds lies in a. A 
set of natural numbers lying in a will also be called an a-large set. 

Once we have fixed this ultrafilter, we can define nonstandard objects and spaces. 

Definition A. 2 (Nonstandard objects). Given a sequence (a;n)ngN of standard objects in il, we define 
their ultralimit \\n\n^a to be the equivalence class of all sequences (yn)neN of standard objects in il 
such that — for n sufficiently close to a. Note that the ultralimit limn_>.Q can also be defined 
even if x^ is only defined for n sufficiently close to a. 

An ultralimit of standard natural numbers is known as a nonstandard natural number, an ultralimit 
of standard real numbers is known as a nonstandard real number, and so on. 

For any standard object x, we identify x with its own ultralimit limn-i-a x. Thus, every standard 
natural number is a nonstandard natural number, etc. 

Any operation or relation on standard objects can be extended to nonstandard objects in the obvious 
manner. Indeed, if O is a fc-ary operation, we define 

Oi lim x]., . . . , lim x^.) :— lim 0{x\, . . . , x^.) 

n— >-a n— >Q n— fa 

and if i? is a fc-ary relation, we define i?(limn-i.a x\, . . . , limn-i-a x^) to be true iff R{x\, . . . , x^) is true 
for all n sufficiently close to a. One easily verifies that these nonstandard extensions of O and R are 
well-defined. 

Example 23. The sum of two nonstandard real numbers limrn-a Xn, limn^a J/n is the nonstandard real 
number 

lim Xn + lim y^ — lim Xn + Vm 

n— n— >-a n— >a 

and the statement livun^a Xn < linin-i-Q Vn means that x^ < y-a for all n sufficiently close to a. 

Definition A. 3 (Ultraproducts) . Let (Arn)neN be a sequence of standard spaces Xn in il indexed by 
the natural numbers. The ultraproduct Ilti^Q defined to be the space of all ultralimits 

limn-i-Q Xn, where Xn G Xn for all n. We refer to the ultraproduct of standard sets as an nonstandard 
set; in a similar vein, an ultraproduct of standard groups is a nonstandard group, and an ultraproduct 
of standard finite sets is a nonstandard finite set. We refer to * X :— J^„_yQ, X as the ultrapower of a 
standard set X; the identification of x with limn-i-Q x causes X to be identified with a subset of * X. 
We will refer to the ultrapower *il of the standard universe il as the nonstandard universe. 

^'^By using this lemma, our results thus rely on the axiom of choice, which we will of course assume throughout this paper. 
On the other hand, it is possible to rephrase the purely combinatorial results in this paper, such as Theorem l2.10l in the 
language of Peano arithmetic. Applying a famous theorem of Godel |21l . we then conclude that Theorem l2.10l is provable 
in ZFC if and only if it is provable in ZF. In fact it is possible, with significant effort, to directly translate these ultrafilter 
arguments to a much lengthier argument in which neither ultrafilters nor the axiom of choice are used. However, this 
would require one to "finitise" or "proof- mine" such infinitary results as the Heine-Borel theorem or Theorem IB. 181 and 
this in turn would require finitisations of the construction of Haar measure and the Peter- Weyl theorem. This would 
lead to a vastly messier argument. 
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Remark A. 4. Nonstandard sets in nonstandard analysis behave analogously in some ways to measurable 
set^ in measure theory; for instance, the union or intersection of two nonstandard sets is again a 
nonstandard set. Also, just as a subset of a measurable set need not be measurable, a subset of a 
nonstandard set need not be another nonstandard set. For instance, the nonstandard natural numbers 
*N is a nonstandard set (being the ultraproduct of the sequence N, N, . . .), but the standard natural 
numbers N, despite being a subset of *N, is not a nonstandard set. 

A fundamental property of ultralimits is that they preserve first-order statements and predicates, 
a fact known as Los 's theorem. Here is one formalisation of this theorem. 

Theorem A. 5 (Los's theorem with parameters). Let m be a standard natural number, and for each 
1 ^ i ^ m, let Xi — livun^aXi^n be a nonstandard object. If P{yi, . . . ,y„i) is a predicate, then 
P{xi, . . . ,Xm) is true {as quantified over the nonstandard universe if and only if P{xi^n, ■ ■ ■ ,Xm,n) 
is true for all n sufficiently close to a {as quantified over the standard universe il). 

Proof. (Sketch) By definition, Los's theorem is true for "primitive" predicates which take the form 
R{xi,...,Xk) for some primitive /c-ary relation R and objects xi, . . . ,Xk, or of the form Xk+i — 
0{xi, . . . ,Xk) for some primitive fc-ary operator O. From the ultrafilter axioms, we also see that 
Los's theorem is closed with respect to boolean operations; for instance, if Theorem IA.6I holds for 
P{xi, . . . , Xm) and Q{xi, . . . , Xm), then it also holds for ^P or P AQ. 

Now, we claim that if Los's theorem holds for the predicate P{xi , . . . , Xm), then it also holds for the 
quantified predicates 3xm ■ P{xi, . . . , Xm) and ^Xm ■ P{xi, . . . , Xm) (where now there are only m — 1 
free variables xi, . . . ,Xm-i, with Xm being bound). We show this just for the existential quantifier 3, 
as the case of the universal quantifier V is similar (and can be deduced from the existential case by 
negation). Suppose first that 3xm '■ P{xi, . . . , Xm) is true in Then there exists Xm = ^^"cun^a Xm,n 
such that P{xi, . . . , Xm) holds; by hypothesis, this implies that P{xi,m . . . , Xra,n) holds for n sufficiently 
close to a, and thus 3xm '■ P{xi^n, ■ ■ ■ ,Xm-i.n,Xm) holds for n in il sufficiently close to a as desired. 
Conversely, if 3xm ■ P{xi^nj ■ • ■ i Xm-i^m Xm) holds in il for n sufficiently close to a, then by the axiom of 
(countable) choice, we may find Xm.n G ii for such n such that P{xi^n, ■ • ■ , Xm-i,n, Xm,n) holds. Setting 
Xjn := limn_>.Q n, we conclude that P{xi, . . . , x^) holds, and the claim follows. 

The above discussion yields Los's theorem for any predicate that can be built out of primitive 
predicates by a finite number of boolean operations and quantifications. However, it is easy to see that 
all predicates are logically equivalent to a predicate of this form. For instance, VaVfoVc : {a + b) + c ^ 
a + {b + c) is equivalent to 

VaV5Vc3d3e3/ : {d = a + b) A {e ^ b + c) A {f ^ d + c) A {f ^ a + e). 

This completes the proof. □ 

In applications, we will actually use a slight generalisation of Los's theorem. 

Theorem A. 6 (Los's theorem with parameters and ultraproducts). Let m,k be standard natural 
numbers. For each 1 ^ i ^ m, let Xi — limn_i.Q Xi^n be a nonstandard object, and for each 1 ^ j ^ k, let 

^"'^ Actually, the notion of an elementary set (e.g. a finite union of intervals) would be an even closer analogy here than 
the notion of a measurable set. 



THE STRUCTURE OF APPROXIMATE GROUPS 



77 



Aj — rin-i-a fee a nonstandard set. If P{yi, . . . , ym] Bi, . . . , Bk) is a predicate over m objects and k 
sets, with the sets Ai , . . . , only appearing in P through the membership predicate x e Bj for various 
j and various objects Bj, then P{xi, . . . , Xm] Ai, . . . , A^) is true (as quantified over the nonstandard 
universe *il) if and only if P{xi^n, • ■ • : Xm,n] ^i,m ■ • ■ : ^fc,n) is true for all n sufficiently close to a {as 
quantified over the standard universe il) . 

Proof. We replace each appearance of a; G Bj in P with a new primitive relation Rj{x,n), which 
is interpreted in il as x e Aj^„. This replaces the predicate P{yi, . . . , y-m] Bi, . . . , Bk) by a predicate 
QiVi, ■ ■■ , Urn, n), with P{ ; Ai^n, • ■ • , Ak,n) logically equivalent to Q{xi^„, . .. , Xm,n, n). One 

easily verifies that P(xi, . . . , Xm', Ai, . . . , Ak) is logically equivalent to Q{xi, . . . , Xm, linin-i-Q n), and the 
claim now follows from Theorem IA.5I □ 

Example 24. Any ultraproduct G := Hn-s-a groups Gn is again a group, because one can write 
the property of G being a group as a predicate P{G) that involves membership in G (as well as the 
constant id and the group operations •, ()~^, of course). Conversely, if G = Hn-i-a ^ group, then 

Gn is a group for all n sufficiently close to a. 

Example 25. Let G = Iln-^Q ultraproduct of groups (and thus also a group), and let A = 

rin-i-a ^^'^ ^ ~ rin^Q subsets of G that are nonstandard sets. Then, for n sufficiently close to 

a, An and Bn are subsets of G,, and Bn (because this statement can be written as a predicate involving 
membership in AmBn,Ga). In a similar (but more complicated) spirit, for any standard JsT G N, A 
can be covered by K left-translates of B if and only if, for n sufficiently close to a, A^ can be covered 
by K left-translates of B^. 

A nonstandard real number a; S *M is said to be bounded if one has ^ G for some standard 
G > 0, and unbounded otherwise. Similarly, we say that x is infinitesimal if |a;| ^ c for all standard 
c > 0; in the former case we write x = 0(1), and in the latter x = o(l). For every bounded real 
number x € *M there is a unique standard real number st(x) G M, called the standard part of R, such 
that X = st{x) + o(l), or equivalently that st(a;) — e ^ x ^ st(a;) + e for all standard e > 0. Indeed, 
one can set st(a;) to be the supremum of all the real numbers y such that x > y (or equivalently, the 
infimum of all the real numbers y such that x < y). We write X ~ 0{Y), X Y , or Y ^ X ii we 
have X ^ CY for some standard G. 

Given a sequence /„ : X^ — > Y^ of standard functions between standard sets Xn,Yn, one can 
form the ultralimit f limn_j.Q, /„, which is a function from the ultraproduct X :— Yin^a 
ultraproduct Y Hn-fQ defined by the formula 

/(lini Xn) := lim fnixn). 

n— >a n— >-a 

Such ultralimits will be called nonstandard functions (and are also known as internal functions in the 
nonstandard analysis literature). In particular, since standard finite sequences (an)^^=i of standard 
reals a„ e M with some standard length N £ N can be viewed as a function n a„ from {1, . . . , N} 
to R, one can thus define nonstandard finite sequences {an)n=i of nonstandard reals a„ € *M with 
some nonstandard length G *N as an ultralimit of standard finite sequences (an„,n)^"=i, thus 
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TV = linin^a and 

n— 

One can then transplant various operations on standard finite sequences to their nonstandard counter- 
parts, and can in particular define the sum 

N 
n=l 

of a nonstandard finite sequence {an)n=i = l™n-j-Q(ann,n)^"=i by the formula 

N N„ 

Ea„ := lim a„„,n- 

n—1 rin— 1 

Appendix B. Local groups 

In this appendix we recall the basic definitions and notations of (symmetric) local group theory, 
following Goldbring |23) . 

Definition B.l (Local group). A symmetric local group G = (G, id, •, ()^^) is a topological space G 
with a distinguished element id G G (the identity element), together with a globally defined inversion 
map : G ^ G and a partially defined product map • : 17 — > G, obeying the following axioms: 

(i) (Partial closure) is an open neighbourhood of (G x {!}) U ({1} x G) in G x G. 

(ii) (Continuity) The maps ()^^ : x !—>■ x^'^ and ■ : {x,y) ^ x ■ y are continuous on G and Q, 
respectively. 

(iii) (Local associativity) li g,h,k G G are such that {g ■ h) ■ k and g ■ (h ■ k) are well-defined (thus 
{g, h), {g ■ h, k), (h, k), {g, h ■ k) all lie in fi), then (g ■ h) ■ k — g ■ {h ■ k). 

(iv) (Identity) For any g Cz G, one has id ■ g ~ g ■ id ~ g. 

(v) (Invertibility) li g E G, then g ■ g^^ and g^^ ■ g are well-defined (i.e. {g,g~^), {g~^,9) G ^) 
and are equal to id. 

If necessary, we will write id, Q as idc, to reduce confusion. If £7 = G x G, we call G a global group 
or a topological group. 

If G has the structure of a smooth finite-dimensional real manifold, and the inversion map ()^^ and 
product map • are smooth maps, we say that G is a local Lie group. 

Remark B.2. One can also consider non-symmetric local groups, in which the inversion map ()^^ is 
only defined on an open neighbourhood A of the identity. However, the theory of non-symmetric local 
groups contains some minor additional technicalities caused by the existence of non-invertible elements 
which we wish to avoid here. As we will not consider non-symmetric local groups anywhere in this 
paper, we will often omit the adjective "symmetric" from the term "local group" when there is no 
chance of confusion. 

Following |23] , we do not explicitly assume that G is Hausdorff . In practice, though, one can reduce 
to the Hausdorff case because the closure of the identity element will turn out to be a closed normal 
subgroup that one can quotient out by. 
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Example 26. If G is a symmetric local group and [/ is a symmetric open neighbourhood of the identity 
(thus g^^ G U whenever g ^ U), then U can also be viewed as a symmetric local group, by restricting 
the domain ft of the product maps to {{g,h) £ il O {U x U) : g ■ h U} (and also restricting the 
topological structure of G to U). We will sometimes write this symmetric local group as G [u to 
emphasise that it is the restriction of G to U . In particular, an important source of local groups comes 
from restricting a global group to an open symmetric neighbourhood of the identity. 

One can also restrict G to non-open symmetric neighbourhoods of the identity, but the resulting 
object obtained is not necessarily a symmetric local group (see e.g. Example [28l below) . 

We say that two symmetric local groups G, G' are locally identical if they have a common restriction, 
thus there exists a U which is an open symmetric neighbourhood of the identity = Ic in both G and 
G' for which the group operations on G and G', when restricted to U, agree completely (in particular, 
they have the same domain and range). This is an equivalence relation, and we will focus on those 
properties of symmetric local groups that are preserved up to local identity. 

In a similar spirit, we say that two subsets A, B of a, symmetric local group in G are locally identical 
if there exists an open neighbourhood U of the identity in G such that ADU = BDU. For instance, all 
neighbourhoods of the identity are locally identical. Note that every open neighbourhood if the identity 
contains an open symmetric neighbourhood, so we can assume here that U is symmetric without loss 
of generality. 

Remark B.3. Symmetric local groups are defined as topological groups, but if one wishes, one can 
restrict attention to discrete symmetric local groups, in which every set is open. In this case, all 
references to continuity, openness, and the Hausdorff property in Definition IB. II can be omitted as 
being automatically satisfied. On the other hand, all discrete local groups are locally equivalent to the 
trivial local group {id}. 

Example 27. If g is a (finite-dimensional) Lie algebra, and _B is a sufficiently small symmetric open 
neighbourhood of the identity in g, then exp(i3) is a symmetric local group, with the multiplication 
law given by the Baker-Campbell-Hausdorff formula. 

Example 28. The closed interval [—1,1] in M with the addition operation is not a symmetric local 
group, because the set {{x,y) G [—1,1] x [—1,1] : x + y & [—1,1]} is not open in [—1,1] x [—1,1]. 
However, the open interval (—1,1) is a symmetric local group. 

Given any finite number of elements gi, . . . , gm in a global group G, one can use the associativity 
axiom to unambiguously define the product gi . . . gm- In a symmetric local group, one can only define 
this product gi ■ ■ ■ gm locally. We formalise this as a definition: 

Definition B.4 (Finite products). Let gi,. . . ,gm be a finite number of elements in a symmetric local 
group G. We say that the product gi . . . gm is well-defined in G (or well-defined for short) if, for each 
1 ^ i ^ j ^ TO, we can find a group element g[ij] G G with the following properties: 

• For each 1 ^ i ^ rri, we have g[j jj = gi. 

• If 1 ^ i ^ j < fc ^ TO, the product g[ij] ■ gy+Lk] is well-defined (i.e. {g[ij], g[j+i^k]) G ^) and 
equal to g^i^k] ■ 
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By induction we see that if these group elements g[ij] exist, then they are unique. We then define 
gi ■ . ■ Qk '■— 9[i.k] ■ If .91 — 5fc = 9i we abbreviate gi . . .gk as g''. By abuse of notation, we also 

write gi . ■ . gm G G to denote the assertion that gi . . . gm is defined in G. 
We adopt the convention that gi . . . g„i — id when m = 0. 

An easy induction using the local associativity axiom shows that if gi, . . . , gm G G is such that 
gi ■ ■ ■ gj is well-defined whenever 1 ^ i < j ^ m with ^ (l,m), and {gi . . . gj) ■ {gj+i ■ ■ ■ gk) is 

well-defined whenever 1 ^ z ^ j < fc ^ m, then gi . . . gm is well-defined, and we have 

(5* ■ • ■ gk) = (ffj . . . 5j) • {gj+i ...gk) 

for all 1 ^ i ^ j < fc ^ m. 

Remark B.5. It is worth pointing out one subtlety here: in order for gi . . . g^, to be well-defined, it is 
necessary that all possible ways of decomposing this m-fold product into pairwise products be well- 
defined. For instance, for .913233 to be well-defined, both {gi ■ 32) • 53 and gi ■ (32 ■ 33) need to be 
well-defined. Similarly, if 31,32,33,34 are such that 315233, (315233) • 34, 323334, and 31 • (323334) are 
well-defined, this is not yet sufficient to deduce that 31323334 is well-defined, because (3132) • (3334) 
need not be well-defined. For instance, in the (additive) local group {—1, 0, +1}, the expression (+1) + 
(— 1) + (—1) + (+1) is not well-defined, because (—1) + (—1) is not well-defined. 

Related to this is the well-known fact that local associativity does not imply global associativity: 
it is possible for two different ways of decomposing an m-fold product into pairwise products to both 
exist, but give distinct values; see [40] for further discussion. For instance, there exists a local group 
G and elements 31,32,33,34 G G such that ((31 • 32) • 33) • 34 and 31 • (32 • (33 • 34)) both exist, but are 
not equal to one another. Of course, in this case, we do not consider 31323334 to be well-defined. 

Another easy induction also shows that for each m ^ 1, the set of tuples (31, ... ,3™) G G™ for 
which 31 ... 3m is well-defined is an open subset of G™. 

Now we extend the notion of products and inverses from individual group elements to sets of such 
elements. 

Definition B.6. Let G be a symmetric local group. A subset A of G is said to be symmetric if the 
set := {3^^ : g € A} is contained in A. li Ai, . . . , A^ are subsets of G, we say that Ai . . . A^ is 
well-defined in G (or well-defined for short) if 31 . . . g^n is well-defined for all 31 G Ai, . . . , 3,,,. G Am, 
in which case we write Ai...Am := {31 • • • 3m : 3i G ^1, ■ • • , 3m G Am}- If Ai = . . . = A,„ ^ A, 
we abbreviate Ai . . . Am as A™. By abuse of notation, we write Ai . . . Am C G for the assertion that 
Ai . . .Am is well-defined in G. We adopt the convention that Ai . . .Am — {id} when m — 0. In 
particular, A" = {id} for any A C G. 

An easy induction (see j231 Lemma 2.5]) shows that for any local group G and any open neigh- 
bourhood Uq of the identity, there exists a nested sequence Uq Z) Ui Z) U2 . . . of symmetric open 
neighbourhoods of the identity such that Um+i C Um for every m ^ 0, which in particular implies that 
Um is well-defined in Uq, and thus Ai . . . Am is well-defined in Uq whenever Ai, . . . , Am C Um- 
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We make the trivial remark that multiphcation of sets is associative: if . . . Am is weh-defined, 
then for any l^i^j<k^m, {Ai ■ Aj) ■ {Aj+i . . . Ak) and Ai. . .Ak are well-defined and equal to 
each other. 

By passing to neighbourhoods such as Um, one can improve the group-like properties of a local 
group. To illustrate this principle, let us first introduce the following definition. 

Definition B.7 (Cancellative local groups). A symmetric local group G is said to be cancellative if 
the following assertions hold: 

(i) Whenever g,h,k E G are such that gh and gk arc well-defined and equal to each other, then 
h = k. (Note that this implies in particular that {g~^)~^ = g.) 

(ii) Whenever g,h,k E G are such that hg and kg are well-defined and equal to each other, then 
h = k. 

(iii) Whenever g,h G G are such that gh and h~^g~^ are well-defined, then {gh)~^ = h~^g~^. (In 
particular, if [/ C G is symmetric and is well-defined in G for some m > 1, then U"^ is 
also symmetric.) 

Clearly all global groups are cancellative. A local group need not be cancellative everywhere; 
however, we can restrict to a large subset on which it is cancellative, by using the following proposition. 

Proposition B.8. Let G be a symmetric local group, and let U be an open symmetric neighbourhood 
of the identity in G such that is well-defined. Then the restriction of G to U is cancellative. 

In particular, the restriction of G to the open symmetric neighbourhood Uq discussed earlier is 
cancellative. We shall see later that the property of being cancellative is hereditary in that it is 
inherited by passing to subgroups and quotients, and because of this we will be able to easily restrict 
attention to the cancellative case in our arguments. 

Proof. If g, h U, then {gh)~^ghh~^g^^ is well-defined in G. By evaluating this well-defined expression 
in two different ways we conclude property (iii). In a similar spirit, by evaluating g~^gh and g~^gk for 
g,h,k G U in two different ways, we obtain (i); and similarly for (ii). □ 

Lemma B.9. Let G he a symmetric local group, and let U, V be open sets with id S V. Then U C U -V 
ifU-V is well-defined, and similarly U CV ■ U ifV-Uis well-defined. 

Proof. We prove the first claim only, as the second is similar. Suppose that g is an adherent point of U. 
By continuity, we can find an open neighbourhood W of g and an open neighbourhood Y of the identity 
such that g ■ g~^ ■ W ■ is well-defined and C V. By continuity, the set {h&W : g~^h G Y} is 
an open neighbourhood of g, and thus contains an element h of U. Writing v := g~^h and expanding 
out g-g~^ ■h-v~^ in two different ways, we conclude that g = hv~^, and thus g G U -V as required. □ 

We can give the class of local groups the structure of a category by defining the notion of a (con- 
tinuous) homomorphism. 

Definition B.IO (Homomorphisms). Let G,H,K be symmetric local groups. A continuous homo- 
morphism (j) : G ^ H ]s & continuous map from G to H with the following properties: 
(i) (j) maps the identity of G to the identity of H: (/)(idG) = idn- 
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(ii) For every g ^ G, we have 0(g) ^ = (j>{g ^). 

(iii) If 5, /i e G are such that g ■ h is well-defined, then (j){g) ■ (j){h) is well-defined and is equal to 

We will often omit the adjective "continuous" when G is discrete. 

A local homomorphism from G to iJ is a continuous homomorphism 4> : U ^ H from a symmetric 
open neighbourhood U of the identity of G to H, where of course we give U the structure of the 
restricted local group G [jj from Example [211 Two local homomorphisms (j> : U H, (/>' : U' H 
are equivalent if there exists a neighbourhood V of the identity contained in both U and C/' such that 
(f) and (j)' agree on V] this is an equivalence relation. A local morphism is an equivalence class of local 
homomorphisms . 

Given two local homomorphisms (j) : U ^ H and ip : V K from G to H and H to K respectively, 
we define the composition map ^o(f) : U' ^ K hy ipo(j){g) := ^{(j){g)), where U' := {g £ U : (f){u) £ V}. 
This allows one to define a composition of two local morphisms in the obvious manner. 

Example 29. There are no non-trivial global morphisms from the unit circle M/Z to M. However, there 
do exist non-trivial local morphisms, such as (the equivalence class of) the map (/> from (—1/4, 1/4) 
mod 1 to K defined by setting (j){x mod 1) := x for all x G (—1/4,1/4). The concept of a local 
homomorphism is closely related to that of a Freiman homomorphism in additive combinatorics, as 
discussed for example in [53] . 

One easily verifies that continuous homomorphisms and local morphisms both obey the axioms of a 
category; in particular, the composition of two continuous homomorphisms is a continuous homomor- 
phism, and the composition of two local morphisms is again a local morphism. As usual in category 
theory, we can now say that two local groups G, G' are locally isomorphic if there exists a local mor- 
phism (j) from G to G' with an inverse 0' from G' to G which is also a local morphism, such that the 
compositions (j) o tp' ov (j)' o (p are equivalent to the identity. Thus, for instance, the unit circle R/Z 
and the line K are locally isomorphic. This notion of local isomorphism generalises the notion of local 
identity from Remark BBl 

Definition B.ll (Sub-local groups 23 ). Given two symmetric local groups G' and G, we say that 
G' is a sub-local group of G if G' is the restriction of G to a symmetric neighbourhood of the identity, 
and there exists an open neighbourhood V of G' with the property that whenever g,h € G' are such 
that gh is defined in V, then gh £ G'] we refer to V as an associated neighbourhood for G'. If G' is 
also a global group, we say that G' is a subgroup of G. 

If G' is a sub-local group of G, we say that G' is normal if there exists an associated neighbourhood 
V for G' with the additional property that whenever g' £ G' ,h G V are such that hg'h~^ is well-defined 
and lies in V, then hg'h^^ e G'. We call V a normalising neighbourhood of G'. 

Example 30. If G, G' are the (additive) local groups G := {-2, -1, 0, +1, +2} and G' := {-1,0,-^1}, 
then G' is a sub- local group of G (with associated neighbourhood V — G'). Note that this is despite G' 
not being closed with respect to addition in G; thus we see why it is necessary to allow the associated 
neighbourhood V to be strictly smaller than G. In a similar vein, the open interval (—1, 1) is a sub-local 
group of (-2,2). 
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The interval (—1, 1) x {0} is also a sub-local group of R^; here, one can take for instance (—1, 1)^ 
as the associated neighbourhood. As all these examples are abelian, they are clearly normal. 

Example 31. Let T : V ^ V he a, linear transformation on a finite-dimensional vector space and let 
G := Z Kt V he the associated semi-direct product. Let G' :— {0} x W, where is a subspace of V 
that is not preserved by T. Then G" is not a normal subgroup of G, but it is a normal sub-local group 
of G, where one can take {0} x as a normalising neighbourhood of G". 

Observe that any sub-local group of a cancellative local group is again a cancellative local group. 

One also easily verifies that if : [/ — > iJ is a local homomorphism from G to H for some open 
neighbourhood U of the identity in G, then ker((/)) is a normal sub-local group of U, and hence of G. 
Note that the kernel of a local morphism is well-defined up to local identity. If H is Hausdorff, then 
the kernel ker((/)) will also be closed. 

Conversely, normal sub-local groups give rise to local homomorphisms into quotient spaces. 

Lemma B.12 (Quotient spaces |23)). Let G be a cancellative local group, and let H be a normal 
sub-local group with normalising neighbourhood V. Let W be a symmetric open neighbourhood of 
the identity such that C V. Then there exists a cancellative local group W/H and a surjective 
continuous homomorphism (f) : W W/H such that, for any g,h ^ W , one has 4'{g) — (j){h) if and 
only if gh^^ G H , and for any E C W/H , one has E open if and only if (f)^^{E) is open. 

Proof. We define an equivalence relation on W by declaring g ^ h\i gh'^^ £ H. Using the cancellative 
properties of V (and hence of W^) we see that this is indeed an equivalence relation. We let W/H := 
{[g]^ : g £ W} he the set of equivalence classes [g]^ :— {h £ W : g ^ h}, with the obvious projection 
map IT : W W/H. We define an inversion relation on W/H by setting [g]Z^ ■— [9^^]~, and a 
product operation by setting to equal [g'h']r^ if g'h' e W for at least one representative g' , h' 

of [g]r^, [h]^ respectively. 

We now verify that these relations are well-defined. To make the inversion relation well-defined, 
we need to verify that if g ~ h, then g~^ ~ h~^. But from the cancellative properties of W^, we 
have g"^(h^^)^^ = g^^{gh^^)^^g, and the claim follows as W^ is a normalising neighbourhood for H. 
Similarly, to make the multiplication relation well-defined, we need to verify that if g,g', h, h' are such 
that g^g',h^ h', and gh,g'h' e W, then gh ^ g'h'. But {gh){g'h')-^ = {g{g')-^)g' {h{h')-^){g')-\ 
and the claim follows as W^ is a normalising neighbourhood for H . Similar arguments (which we omit) 
show that W/H obeys the identity, inverse, and local associativity axioms. 

Next, we give W/H the quotient topology, declaring a set E in W/H open iff its inverse image 
7r^^(i?) is open in W (or equivalently, in G). One easily verifies that W/H becomes a symmetric local 
group, and the claim follows. □ 

Example 32. Let G be the additive local group G := (—2,2)^, and let H he the sub-local group 
H := {0} X (—1, 1), with normalising neighbourhood V := (—1, 1)^. If we then set W := (—0.1,0.1)^, 
then the hypotheses of Lemma [6.121 are obeyed, and W/H can be identified with (—0.1,0.1), with the 
projection map : (x, y) i— > x. 

Example 33. Let G be the torus (R/Z)^, and let H be the sub-local group H = {{x, ax) mod : a; G 
(—0.1,0.1)}, where < a < 1 is an irrational number, with normalising neighbourhood (—0.1,0.1)^ 
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mod Tl? . Set W := (—0.01,0.01)^ mod 1? . Then the hypotheses of Lemma fB.12l are again obeyed, 
and WjH can be identified with the interval / := (— O.Of(f + Q;),0.01(f + a)), with the projection 
map : (x, y) mod 1? ^ y — ax for {x,y) £ (—0.01,0.01)^. Note, in contrast, that if one quotiented 
G by the global group (H) ~ {{x,ax) mod : a; e M} generated by H, the quotient would be a 
non-Hausdorff space (and would also contain a dense set of torsion points, in contrast to the interval 
I which is "locally torsion free"). It is because of this pathological behaviour of quotienting by global 
groups that we need to work with local group quotients instead. 

Remark B.13. As we have seen in the above discussion, many familiar concepts in (global) group 
theory have analogues in the local group setting. We will however mention one important global 
group-theoretic concept that does not have a convenient local analogue, and that is the notion of the 
global group (A) generated by a set A of generators. The problem is that this global group (A) consists 
of words in A of arbitrarily length, whereas in a local group one can typically only multiply together a 
bounded number of elements oi A. However, sets such as A™ or (AuA^^ U{id})™ for various choices of 
exponent m can sometimes serve as a partial substitute for this concept in local group theory, though 
one of course has to keep track of the precise value of m throughout the argument. 

Locally compact local groups. Recall that a topological space X is said to be locally compact 
if and every point in X has a compact neighbourhood. In particular, one can speak of a locally compact 
symmetric local group. 

To verify local compactness of a symmetric local group, it suffices to do so at the identity. 

Lemma B.14. Let G be a symmetric local group. Then G is locally compact if and only if there is a 
compact symmetric neighbourhood of the identity. 

Proof. [23, Lemma 2.16] The "only if" part is clear (since id already has a compact neighbourhood). 
Now we turn to the "if" part. Let K he a compact symmetric neighbourhood of the identity. By 
continuity, there exists an open neighbourhood V oi g such that g ■ g~^ ■ V ■ V is well-defined and 
■ V ■ V C K . In particular, h H> g^^h is a homeomorphism from V ■ V to g^^ ■ V ■ V which is 
inverted by the map k i— > gk. By Lemma lB.9[ we conclude that h i~>- g^^h is also a homeomorphism 
from V to g^^ ■ V = g^^ ■ V. In particular, since g^^ ■ V is a, closed subset of K, it is compact, and so 
V is compact also. Thus g has a precompact neighbourhood as required. □ 

Corollary B.15. If G is a locally compact symmetric local group, and U is a symmetric open neigh- 
bourhood of the identity, then U is also a locally compact local group. 

Proof. By Lemma IB. 141 G contains a symmetric precompact open neighbourhood V of the identity. 
By continuity, one can find a symmetric open neighbourhood W of the identity such that ly • is 
well-defined inVDU. By Lemma FB.QI we conclude that the closure W in U is the same as the closure 
of Vl^ in G; as it is contained in the precompact set V, it is thus precompact. The claim then follows 
from another application of Lemma [6.141 □ 

An important subclass of the locally compact local groups are the (symmetric) local Lie groups, 
defined as those (symmetric) local groups which are also smooth finite-dimensional real manifolds. 
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such that the group operations are smooth on their domain of definition. We have the following basic 
theorem. 

Theorem B.16 (Lie's third theorem). Every local Lie group is locally isomorphic to a global Lie group. 
Furthermore, one can take the global Lie group to be both connected and simply connected. 

See e.g. [49] for a proof. 

We have the following deep structure theorem for locally compact global groups, due to Gleason 
and Yamabe [60] . 

Theorem B.17 (Gleason- Yamabe) . Suppose that G is a locally compact global group. Then there is 
an open subgroup G' of G with the following property: inside any neighbourhood of the identity U Q G' , 
there is a compact normal subgroup H such that G' / H is isomorphic to a connected global Lie group. 

The analogous theorem for locally compact local groups was established more recently by Goldbring. 

Theorem B.18 (Goldbring). Suppose that G is a locally compact local group. Then some restriction G' 
of G to a symmetric neighbourhood of the identity has the following property. Inside any neighbourhood 
of the identity U C G' , there is a compact normal subgroup H such that G' / H is isomorphic to a local 
Lie group. 

Proof. The only self-contained proof of Theorem IB. 181 in the literature is in the thesis [25 , where it 
follows from a combination of §4.5 and [22j Proposition 4.7.1]. A more easily accessible account of 
essentially the same material follows by combining [SS] Proposition 4.1] (reduction to the NSS case) 
with [23] §8] (treatment of the NSS case). Alternatively (though ultimately more circuitously) one may 
apply the main result of [55] . which shows that G has a restriction in common with a global locally 
compact group, followed by Theorem IB. 171 For our applications, we only need to apply Theorem lB.181 
when G is metrisable, although the general case can be deduced from the metrisable case without much 
effort. □ 

Appendix C. Nilprogressions and related objects 

In this appendix we prove two basic facts about coset nilprogressions in normal form, namely that 
after shrinking the length parameter slightly they are approximate groups, and are globalisable; that 
is to say isomorphic to subsets of a global group. The proofs of these facts are quite short due to the 
strength of the normal form axioms. One can establish similar assertions without the normal form 
hypothesis, but the arguments are much more complicated in that they require one to work with an 
explicit basis for the free nilpotent group. They are not needed in this paper. 

Lemma C.l. Let P = Pff (ui, . . . , Ur', Ni, . . . , Nr) be a coset nilprogression in C-normal form. Then 
for all e > that are sufficiently small depending on r, C, one has 

(C.l) (l + iVi)...(l + iV,,)|i/| «,,c,r \PH{ui,...,Ur;eNi,...,eNr)\ «c (l + Ni) . . . {I + Nr)\H\. 
and hence, by the volume bounds on P, 

\PH{ui,...,Ur;eNi,...,eNr)\ :^e,c,r \P\- 
Furthermore, Ph{ui, . . . , Ur', eNi, . . . , eNr) is a O^^c,r{^)-o-ppfoximate group. 
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Proof. By quotienting out the finite group H, wliicli is normalised by Ph{ui, . . . ,Ur',eNi, . . . ,£Nr)^ 
(say) if £ is small enough, we may assume that H is trivial. The upper bound in (IC.ip is then 
immediate from the upper bound in (j2.2p . while the lower bound follows from the local properness 
axiom in Definition 12.61 

From (jC.l|) and the Ruzsa covering lemma we see that for e small enough, 

Pff(wi,...,u,;2eiVi,...,2eiV,) 

is covered by Oe,c,r(l) translates of Ph{ui, ■ • ■ , Ur] sNi, . . . , eNr), and so the final claim follows from 
Lemma [5. II □ 

Remark C.2. It is in fact possible to show that \Ph{ui, . . . , Ur', eNi, . . . , £Nr)\ decays at a polynomial 
rate in e, and that Ph{ui, . . . , Ur] sNi, . . . , eNr) is a Oc,r(l)-appi'Oximate group uniformly in e, but we 
will not need these stronger conclusions here. 

Lemma C.3. Let P = Pff (ui, . . . , m^; A^i, . . . , N,,) be a coset nilprogression in C-normal form. Then 
for all e > that are sufficiently small depending on r,C, the set Ph{ui, . . . ,Ur]eNi, . . . ,eNr) is 
isomorphic to a subset of a global group G. 

From this lemma (and Lemma IC.1[) we see that Theorem 12.131 follows immediately from Theorem 

Proof. We first establish the claim under the additional hypothesis that the Ni, . . . , Nr are sufiiciently 
large depending on r, C; we will remove this hypothesis at the end of the argument. 

Let vi, . . . ,Vr be lifts of the generators Ui, . . . , oi P / H to P . By Definition 12 .61 and the normality 
of H, one has 

(C.2) K V,] e P (.,.r, Oc(|^), . . . , Oci^)) H 

for all I ^ i < j ^ r; note that the hypothesis that the Ni are large ensure that the right-hand side is 
well-defined in P. 

Consider a word in P . . . , Wr-; <^c(^^), ■ • ■ , Oc( j^^. )^ , which therefore contains Oci^jp) 

copies of w^^, Oc ( j^^j^. ) copies of i'^2' ^^"^ ^'^ forth. Let us the leftmost copy of v^_^-^^ and move it 
all the way to the left. Each time it passes through a for some j -I- 1 < fc ^ r, we ue (jC.2[) and 
create Qc( jv-^Vfc ) copies of v^^ for each I > k, plus an element of H which can be pushed all 
the way to the right using the normality of H. Thus, if one initially had Qfc J^^. copies of v^^ for each 
j + 1 < fc ^ r before one started moving the leftmost vf_^i to the left, then by the end of the move, 
one would have 

Ni ( akNk Ni 

^^■^^ ""'m]^ ''I ^ N,Nj N,+,N, 

' ^ \j+l<k<l * ^ " 

copies of vf^ for each j + 1 <l ^r. We may simplify the expression (jC.3[) as 

^ 1 ± \ \ Ni 

ai + Oc \ y , -Tr afc 



^ . iV,+i I I N,Nj 



yj + l<k<l 
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Thus we have effectively replaced the sequence (afc)j+i<fc^r by the sequence 

We iterate this process Oc ( ^'n^. ) — Oc{Nj+i) times, and note that the Ofc were initially of size Oci^), 
and end up at a sequence, all of whose entries are of size Oc,r(l)- In other words, after moving all 
copies of v^_^-^ to the left, and all copies of H to the right, we end up with Oc,r{ j^^. ) copies of in 
the middle for each j + 1 < k ^ r. We conclude that 

for some n^jj+i = Oc{jfjf-)] note that as long as the Ni are large enough, all words that appear in 
this reorganisation will lie inside P and so the algebraic manipulations can be justified. Iterating this 
procedure r — j times (which will be justified if the Ni are large enough) we see that 

(C.4) K,i;,]=«;;r+\.. 

for some 71^^,^ — Oc,ri j^^. ) and hij e H. Also, one has 

(C.5) v.hvA^Mh) 

for some (outer) automorphism (f>i : H ^ H oi H. 

Now let G be the global group generated by H and formal generators ei,. ..,er, subject to the 
relations 

(C.6) [e„e,] =e;;f+\..e^-ft,,, 

and 

(C.7) e,heA = Mh) 

for 1 ^ i < j !^ r. We claim that for e small enough, there is an injective homomorphism from 
Ph{ui, . . . , Ur; eNi, . . . , eNr) to G, which will give the claim. 
To see this, first observe from the normality of H that 

Ph{ui, ...,Ur; eNi, eNr) = P{vi,. ■ . ,Vr] eNi, eNr)H. 

Organising the words in P(wi, . . . , u^; £-^1, ■ • • , £-^r) by moving all occurrences of vi to the left (using 
(jC.4p ') and all occurrences of H to the right (using the normality of H) wc then have 

(C.8) Ph{uu. ..,ur; eNi, . . . , eNr) C P{vi;eNi)P{v2, . . . , w.; Oc,r(£^2), • ■ • , OcA£Nr))H 

assuming e is small enough in order to justify all the algebraic manipulations. Iterating this we see 
that 

(C.9) PH{ui,...,Ur;eNi,..., eNr) C P{vi;OcA^Ni)) . . . F(«,; OcA^Nr))H. 
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Thus it suffices to establish an injective homomorphism (j) from the set 
(C.f 0) . . . v^-^h : = OcA^N,)- h e H} 

to G. From the local properness property in Definition 12.61 all the products in (|C.10p are distinct if e 
is small enough. We may thus define (f) by the formula 

Next, we show that (j) is injective. Indeed, suppose that there exist ni^n[ — Oc.ri^Ni) and h,h' £ H 
with 

and thus 

...e:?-/i = ei'v..e"^/i'. 

By the universal properties of G, there is a homomorphism from G to Z that maps ei to 1 and 
annihilates the other and H. This implies that ni — n[. We can then eliminate ni,n'i and work 
with the subgroup G2 of G generated by 62, . . . , and H. From abstract nonsense we see that G2 is 
universal with respect to the constraints (jC.6p . ()C.7|) for i ^ 2, and that G is the semidirect product of 
G2 with Z using the conjugation action of ei on G2 defined using (jC.6|) . (|C.7p for i = 1. In particular, 
there is a homomorphism from G2 to Z that maps 62 to 1 and annihilates the Ci and H for i > 2. 
This gives n2 = n^- Continuing in this fashion we see that Ui = n'^ for all i and hence h = h' , which 
establishes injectivity. 

Finally, we need to show that is a homomorphism. It suffices to show that if rii, n^, n" = Oc,r{sNi) 
and h,h' ,h" € H are such that 

(C.ll) <^ . . . . . Vr' h' = . . . Vr"h" 

then 

(C.12) . . . e^/ief^ . . . e"^/i' = ef . . . e""/i". 

To see this, we rearrange the word on the left-hand side of (jC.lip by moving all occurrences of vi to 
the left, and all occurrences of elements of H to the right, using (jC.4l) and (|C.5|) : if e is small enough, 
then all manipulations take place inside P and can thus be justified. Iterating this process, we must 
eventually be able to express this word in the form u"^ . . . v"''h for some hi = Oc.risNi) and h E H. 
By injectivity, we then have fii = n" and h = h'l . But then if one formally replaces all the Vi by and 
uses (jC.6p . (jC.7|) in place of (jC.4[) . (IC.5|) in the rearrangement procedure just described, we conclude 
(jC.12p . and the claim follows. 

Now we remove the hypothesis that the Ni, . . . , Nr are sufficiently large depending on r, C. Let 
F : M+ — > M+ be a function depending on r, C to be chosen later. By the pigeonhole principle, we can 
find a threshold M > 1 with M — 0_f(1) such that every length Ni is either less than M, or larger 
than F{M). If we let 1 ^ ii < . . . < ^ r be those indices ij with Ni. > F{M), then we see (if 
F is sufficiently rapidly growing) that Ph {ui-^ , ■ • ■ , Ui^ , ; iV^ ^ , . . . , Ni^ ) will be a coset nilprogression in 
Oc,r,M (l)-normal form. For F sufficiently rapidly growing, the preceding argument then applies to 
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conclude that Pniui^ , ■ • ■ , Ui^, ; eNi^ , ■ • ■ , e-^v) is isomorphic to a subset of a global group if e is small 
enough depending on C,r, M, and the claim follows. □ 

Remark C.4. From (jC.9[) we see that every element in Ph{ui, . . . , u^; eA^i, . . . , sNr) takes the form 

(C.13) v^'...v^-h 

for some integers a^, . . . ,0^ with = Ocri^Ni) and h Cz H. Conversely, it is clear that if \ai\ ^ eA^,; 
then all expressions of the form (IC.13|) lie in Pu{ui, . . . , Uj.; eNi, . . . , eNj.). Informally, we thus see that 
the nilprogression Ph{ui, . . . , Ur] eNi, . . . , eNr) is comparable in some sense to the nilbox 

KV..<-/i : |a,| ^eNf,heH}. 

We will however not exploit this description of nilprogressions in this paper. 

A variant of the above analysis also gives polynomial growth of progressions in C-normal form in 
the global case. 

Proposition C.5 (Polynomial growth). Let P = Ph{ui, . . . , Ur] iVi, . . . , N^) he a coset nilprogression 
in C-normal form in a global group. Then for all 1, one has |P™| <$^c.r m'^'^-''^^-' |P| . 

Proof. We allow all implied constants to depend on C, r. As H is normalised by P, we may quotient 
out by H and reduce to the case when H is trivial. Then 

C P{ui, . . .,Ur;mNi, . . . ,mNr) 

and so it suffices (by the volume bound (|2.2p ) to show that 

|P(ui, ...,ur] mNi, mNr)\ < m'^^'^^{Ni + 1) . . . {Nr + 1). 

By modifying the proof of (|C.8|) , one easily verifies that 

P(ui, ...,Ur; niNi, niNr) C P{vi;mNi)P{v2, ...,Vr; ©(m^iVa), . . . , 0{m^Nr)); 

iterating this, one sees that 

P{ui, ...,Ur; miVi, . . . , mNr) C P{vi;0{m'="^^^)Ni) . . . P{vr] 0{m°^^^ Nr)), 

and the claim follows. □ 
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